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SUMMARY OF THE INVENTION 

Novel water-soluble ligand-binding proteins have been identified and isolated, which 
have a ligand-binding profile substantially similar to that of ligand-gated ion channels. 
DNA molecules encoding such proteins have been cloned and characterized. The 

15 biological and structural properties of these proteins are disclosed, as is the amino 
acid and nucleotide sequence. The recombinant DIMA molecules, and portions 
thereof, are useful for isolating homologues of the DNA molecules, identifying and 
isolating genomic equivalents of the DNA molecules, and identifying, detecting or 
isolating mutant forms of the DNA molecules. Using a recombinant expression 

20 system functional DNA molecules encoding the water-soluble ligand-binding proteins 
as well as chimeras have been functionally produced. Furthermore, the water-soluble 
ligand-binding proteins could be crystallized revealing the three dimensional (3D) 
structure and enabling the modeling of the 3D structure of the ligand-binding domain 
of ligand-gated ion channels. The invention is further in the field of the development 

25 of new drugs that are capable of selectively intervening in neuronal signaling 
pathways. The invention is more in particular concerned with providing new 
analogues of the channel-coupled receptors, crystal structures thereof and to their 
use in screening ligands for these receptors. 

30 Several documents are cited throughout the text of this specification either by name 
or are referred to by numerals within parenthesis. Full bibliographic citations may be 
found at the end of the specification immediately preceding the claims. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated herein by reference; however, there is no admission 

35 that any document cited is indeed prior art as to the present invention. 
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BACKGROUND OF THE INVENTION 

The communication in the central nervous system (CNS) occurs through a complex 
interaction of electrical and chemical signals. Molecules bearing chemical information 
are called neurotransmitters. The chemical information is converted in electric 
5 currents on the post-synaptic membrane, which is specialised in recognising and 
binding neurotransmitters by means of protein receptors. The specific binding of a 
ligand to one type of such receptors, the ionotropic receptors, induces a fast opening 
of the ion channel coupled to the receptor. An important group of ionotropic receptors 
is the superfamily of the channel-coupled receptors, also referred to as ligand-gated 

10 receptors, including the 7-amino~butyric acid (GABA A ) receptor, the glycine receptor, 
the serotonin-3 (5-HT3) receptor and both neuronal and muscle-type nicotinic 
acetylcholine receptors (nAChR). These receptors share certain structural features 
such as (1) a 15-residue cysteine loop between amino acids 128 and 142 
corresponding to the Torpedo AChR a unit, (2) four trans-membrane domains, (3) 

15 similar subunit arrangements, and (4) homologies in amino acid sequence. Activation 
of these receptors causes a change in electrical current and hyperpolarisation of the 
cell membrane and consequently an inhibition of the electrical activity of the cell. The 
GABA A receptor and the glycine receptor are coupled to a chloride-selective channel, 
and thus the inhibition of the electrical activity leads to inhibition of the cell response. 

20 On the other hand, activation of the 5-HT3 receptor and the nAChRs provokes an 
excitatory response on the cell because they are connected to a cation-selective 
channel (Na+, K+, Ca2+). The AChRs are the best studied of the ligand-gated 
receptors; for a review, see Arias, Brain Research Reviews, 25 (1997)133-191 and 
Arias, Neurochem. Int. 36 (2000), 595-645). Mutations in these ligand-gated ion- 

25 channels (LGICs) lead to diseases such as congenital myasthenia gravis, epilepsy, 
startle syndrome and alcohol sensitivity (Vafa and Schofield, Int. Rev. Neurobiol. 42, 
285-332; 1998). NAChRs mediate nicotine addiction in chronic tobacco users. Since 
nicotine binding to these receptors also has a positive effect on Alzheimer's disease, 
Parkinson's disease and schizophrenia these receptors present an important drug 

30 target (Paterson and Nordberg, A. Neuronal nicotinic receptors in the human brain. 
Prog. Neurobiol. 61, 75-111; 2000). 

The development of new active compounds that can selectively or - as the case may 
be - a-selectively bind to the channel-coupled receptors, is of utmost importance for 
the understanding of the processes occurring in the nervous system and for the 
35 treatment of disturbances of neural conditions. The" development of such active 
compounds requires the availability of a reliable model system for the corresponding 
receptors. The primary structural features (amino acid sequences) of the various 
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receptors have been largely elucidated by now. Certain subunits of the AChRs have 
been found to be determinant in the pharmacological specificity or affinity of the 
receptor for its ligand (Corringer et al., J. Neuroscience 18 (1998), 648-657), 
However, the study of the ligand binding properties of the receptor proteins is 
hampered by the fact that the spatial structure of the proteins - which is decisive in 
the binding of ligands - is still unknown. This is partly because crystallisation of the 
receptor proteins has been unsuccessful up to now. 

The above-defined technical problem is solved by the present invention by providing 
the embodiments characterized in the claims. 

Accordingly, in one aspect the present invention relates to a water-soluble protein 
derived from a mollusc being capable of binding a ligand of a ligand-gated receptor. 

It has been found according to the invention that acetylcholine-binding proteins 
(AChBP) of certain molluscs show a surprising structural similarity with the channel- 
15 coupled receptors on the one hand and have interesting physical properties, such as 
water-solubility, on the other hand. The molluscan AChBPs are capable of forming 
multimers, especially pentamers, and of binding specific, toxins such as ct- 
bungarotoxin. These multimers may be homogeneous (identical units) or 
heterogeneous (different units). These properties make them eminently suitable as 
20 model systems for studying the binding of candidate ligands to the channel-coupled 
receptors. It has been possible to produce these molluscan AChBPs in recombinant 
systems, thus allowing convenient and large-scale production thereof. Moreover, it is 
feasible to construct hybrid proteins sharing the physical properties of the mollusc 
AChBP with the pharmacological properties of the (human) channel-coupled 
25 receptors, thus providing new dedicated tools for screening ligands for these 
receptors. 

The AChBP is a naturally occurring analogue of the extracellular domains of the ct- 
subunits of the neuronal nicotinic acetylcholine receptors (nAChRs). In contrast to the 

30 nAChRs, it lacks domains to form a transmembrane ion channel, but alike the 
nAChRs it assembles into a homo-pentamer (Figure 6). Moreover, AChBP has 
ligand-binding characteristics that are typical for a nicotinic receptor. The 3- 
dimensional structure of AChBP was solved by X-ray crystallography at 2.7A 
resolution (current R (ac tor = 27.9 %, R !ree = 30.0 %). In crystals, as in solution, AChBP 

35 forms a stable homo-pentamer with dimensions comparable to those of the ligand- 
binding domain of the nAChR, as determined in EM studies by Unwin, Struct. Struct. 
Biol 121 (1998), 181-190. The high-resolution crystal structure of AChBP, along with 



5 



10 



AO 01-58951 



Page 5 of 259 



WO 01/58951 PCT/EP01/01457 

-4 

biochemical and pharmacological data, supports the extrapolation of AChBP as a 
good mimic of ligand-binding domains of ligand-gated ion channels including nAChR, 
5-HT3R, GABAa, c R and GlyR. 

Four AChBPs according to the present invention are exemplified herein, isolated and 

5 cloned from the CNS of Lymnaea stagnalis (L-AChBP_T1 and L-AChBPJ"2) and 
Bulinus truncatus (B-AChBP_T1 and B-AChBP_T2). L-AChBP_T1 and 2 are 229 
amino acid proteins with a signal sequence of 1 9 amino acids (224 and 21 amino 
acids, respectively, for B-AChBP_T1 and 2; see also Figure 1) and have sequence 
homology with the extracellular domains of the subunits of ligand-gated ion channels 

10 (Figure 3), in particular with those of the nAChRs (Figure 4 and 5). The mass of the 
purified AChBP from Lymnaea has been determined by mass-spectrometry. The 
glycosylated form has a mass of about 24720 Da and the de-glycosylated form of 
about 23832 Da. In SDS-PAGE the glycosylated AChBP migrates between the 14 
and 26 kDA marker proteins. Hydrophopicity plots of the AChBPs are shown in 

15 Figure 2, which reveal those regions of the ligand-binding proteins that are 
particularly hydrophilic and thus may be replaced at least in part or essential amino 
acids thereof in the ligand-binding domain of the ligand-gated ion channel. Sequence 
conservation is particularly high in the so-called loop areas (reviewed by Arias, 
Neurochem. Int. 36 (2000), 595-645), which contain the residues involved in ligand- 

20 binding. The cysteine residues characteristic for the Cys-loop family of ligand gated 
receptors are conserved in AChBP. Also the double cysteine typically found in the 
alpha subunits of the nAChR is present. AChBP protein sequence ends at the 
position where in the nAChRs the first predicted transmembrane domain would start. 
The ligand-binding characteristics of AChPBs are described in Example 4 and 

25 summerized in Table 2. 

The terms "channel coupled receptors", "ligand-gated receptor", "ligand-gated ion 
channel" are used interchangeable herein. However, in context with the natural 
occurring, in particular human molecules the term "ligand-gated ion channel" is 

30 preferably used. The water-soluble ligand-binding protein of the invention can also be 
characterized as a ligand-binding protein having at least 10%, more preferably at 
least 12%, still more preferably at least 1 5% and most preferably at least 20% amino 
acid sequence identity to a vertebrate ligand-gated ion channel but missing any 
trans-membrane domain. A ligand-gated receptor of the present invention is 

35 characterized by having substantially the same ligand-binding characteristics of a 
vertebrate, preferably mammalian, most preferably human ligand-gated ion channel 
but comprising at least one alteration 'in the original amino acid sequence, said 



,VO0V589h1 



Page 6 of 259 



WO 01/58951 PCT/EP01/01457 

-5 

alteration resulting in the presence of an amino acid determining or contributing to 
the water-solubility of the water-soluble ligand-protein found in molluscs, in particular 
snails such as those described in more detail below. 

5 The terms "ligand-binding protein", Tigand-binding domain" and Tigand-binding 
receptor" are meant to at least include the portion of a water-soluble ligand-binding 
protein or corresponding modified ligand-gated ion channel required for binding a 
ligand. Minimally the ligand-binding domain consists of a peptide containing that 
domain. However the use of this term is meant to include a ligand- binding domain or 
10 protein that is comprised by a larger portion of, for example, ligand-gated ion 
channel, such as a fully reconstituted nicotinic acetylcholine receptor. 

As shown in Figure 3 the nicotinic acetylcholine receptor (nAChR) belongs to a well- 
understood member of the ligand-gated ion channels superfamily. The members of 

15 this signaling protein group, including 5-HT3, glycine, GABA A , and GABAc receptors, 
are thought to share common secondary, tertiary, and quaternary structures on the 
basis of a high degree of sequence similarity. Therefore, it is expected that the novel 
findings in respect to the exemplified AChBP equally apply to the other members of 
the mentioned ligand-gated ion channels superfamily. Thus, either water-soluble 

20 protein being capable of binding a ligand of any of those ligand-gated ion channels 
may be found in molluscs or the present 5-HT3, GABA A , and glycine receptors can 
be modified such as to substantially retain their binding affinity. 
Accordingly, the ligand of the water-soluble ligand-binding protein is preferably 
acetylcholine, gamma-amino-butyric acid (GABA), glycine, nicotine or serotonin. 

25 Isolation of such water-soluble ligand-binding proteins can be done as described in 
Example 1 for the AChBP of the present invention. Instead of a-bungarotoxin other 
known iigands or can be used for affinity purification. Most preferably, water-soluble 
ligand-binding protein of the invention is a acetylcholine-binding protein (AChBP). 
Preferably, the ligand-binding protein displays substantially the binding 

30 characteristics shown in Table 2. 

The acetylcholine-binding proteins to be used according to the invention are originally 
derived from aquatic molluscan species, especially species from the class of the 
snails (Gastropoda), in particular from the order of the lunged snails (Pulmonata), 
The order of the Pulmonata is divided into the suborders of the Basommatophora 

35 (mostly aquatic snails), Systellommatophora and Stylommatophora (mostly land 
snails). The Basommatophora include the families of the Acroloxidae (e.g. genus 
Acroloxus), Lymnaeidae (e.g. genera Galba, Stagnicola, Radix and Lymnaea), 
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Physidae (e.g. genera Physa and Aplexa) and Planorbidae (e.g. genera Planorbis, 
Anisus, Ancylus, Gyrauius, Biomphalaria and Bulinus). Examples of suitable species 
are Lymnaea stagnalis (pond snail) and Bulinus truncatus. The isolation of the 
AChBPs from these snails, cloning of the cDNA encoding these AChBPs and their 

5 characterization including the full amino acid sequences is described in the 
examples. The cDNA and amino acid sequences of the AChBPs of Lymnaea 
stagnalis are depicted in SEQ ID Nos. 1 and 2 (L-AChBP_T1) and SEQ ID Nos. 3 
and 4 (L-AChBP_T2). Those of Bulinus truncatus are depicted in SEQ ID Nos. 5 and 
6 (B-AChBP_T1) and SEQ ID Nos. 7 and 8 (B-AChBP_T2). Features of these 

10 proteins are further described in the examples and the accompanying figures. 

While a water-soluble ligand-binding protein derived form a Pulmonata species, 
preferably from a Basommatophora species is preferred, it will be appreciated that 
the present invention generally relates to any water-soluble protein being capable of 
15 binding a ligand of a ligand-gated receptor comprising an amino acid sequence 
selected from the group consisting of: 

(a) an amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 
6 or 8 or a functional equivalent thereof, or a fragment of at least 5 
continuous amino acids thereof; 
20 (b) an amino acid sequence having at least 30% amino acid identity to the 
amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8; and 
(c) an amino acid sequence resulting in a protein which is detectable by a 
monoclonal or polyclonal antibody which recognises, preferably with a binding 
affinity of at least 10" 7 M, a protein comprising an amino acid sequence of (a) 
25 or (b). 

Identity or similarity, as known in the art, are relationships between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, identity also means the degree of sequence 

30 relatedness between polypeptide or polynucleotide sequences, as the case may be, 
as determined by the match between strings of such sequences. Both identity and 
similarity can be readily calculated (Computational Molecular Biology, Lesk, ed., 
Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome 
Projects, Smith, ed., Academic Press, New York, 1993; Computer Analysis of 

35 Sequence Data, Part I, Griffin and Griffin, eds., Humana Press, New Jersey, 1994; 
Sequence Analysis in Molecular Biology, von Heinje, Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov and Devereux, eds., M Stockton Press, New 
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York, 1991). While there exist a number of methods to measure identity and similarity 
between two polynucleotide or two polypeptide sequences, both terms are well 
known to skilled artisans (von Heinje, supra; Gribskov and Devereux, supra; and 
Carillo and Lipman SIAM J. Applied Math. 48 (1988), 1073). Methods commonly 

5 employed to determine identity or similarity between sequences include, but are not 
limited to those disclosed in Carillo and Lipman; see supra. Preferred methods to 
determine identity are designed to give the largest match between the sequences 
tested. Methods to determine identity and similarity are codified in computer 
programs. Preferred computer program methods to determine identity and similarity 

10 between two sequences include, but are not limited to, GCG program package 
(Devereux et a!., Nucleic Acids Research 12 (1984), 387), BLASTP, BLASTN, psi 
BLAST and FASTA (Atschul et al M J. Molec. Biol. 21 5 (1990), 403). 



In another embodiment, the present invention relates to a water-soluble protein being 
1 5 capable of binding a ligand of a ligand-gated receptor comprising 

(a) at least the amino acids of the water-soluble protein described above 
determining solubility of said protein, in the same or 
corresponding positions as in said protein; and 

(b) at least 4 amino acids determining binding to said ligand. 

20 

Protein expression studies have shown that wild-type AChBP of the mollusc 
Lymnaea stagnalis can be produced in Pichia pastoris yeast. The yeast cells express 
AChBP in a homopentameric form and secrete the protein complex into the medium. 
The large amounts of AChBP per volume of medium produced (up to 2 mg per liter 

25 medium) and the large volumes of yeast that can be cultured allow a large-scale 
production of AChBP. Besides the wild-type AChBP, various AChBP mutants have 
been produced in Pichia pastoris. These include mutants containing the following 
single point mutations (the numbers refer to the amino acid position in the AChBP 
sequence of Lymnaea stagnalis depicted in SEQ ID No. 2 counted from the first 

30 amino acid of the signal peptide; the letter before the number indicates the original 
amino acid and the letter after the number indicates the mutant amino acid) N85D, 
H164Y, D194N, Y204P, Y211P and D213N. 

Thus the invention pertains to water-soluble proteins derived from molluscan, 
preferably acetylcholine binding proteins (AChBP's), which are capable of forming 
35 multimers, and are capable of binding a ligand of a ligand-gated receptor. These 
proteins comprise, on the one hand, at least of the amino acids of the AChBP 
determining solubility of the AChBP in the same positions as in the AChBP, and, on 
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the other hand, amino acids determining binding to the ligand of the ligand-gated 
receptor. The degree of identity with the molluscan AChBP sequence can be defined 
by amino acid identity, of at least 15%, preferably 20%, more preferably 30%, still 
more preferably 40%, preferably at least 50 or even at least 60%, preferably more 
5 than 70%, more preferably more than 80% and most preferably at least 90% identity, 
or more, as determined, e.g., using the art-known BLAST algorithm. The amino acids 
determining binding to the ligand should comprise at least 4 amino acids, preferably 
at least 6 or even at least 8 amino acids, including a series of at least 3 or 4 amino 
acids, corresponding to the receptor sequence and preferably differing from the 

10 corresponding AChBP amino acids. Preferred embodiments of these proteins are 
further defined below. Usually, the water-soluble ligand-binding protein or domain as 
part of a for example chimeric ligand-gated ion channel will comprise 200-240 amino 
acids. The ligand is preferably acetylcholine, nicotine, lophotoxin, d-tubocurarine, 
carbamylcholine, galanthamine or epibatidine. 

15 Said ligand-gated receptor can be derived from an arthropod (preferably insect), a 
plant (preferably a higher plant, most preferably a seed plant) or a chordate 
(preferably a mammalian, most preferably human), preferably said ligand-gated 
receptor is a nicotinic acetylcholine receptor. 



20 Usually, the said amino acids in the water-soluble ligand-binding proteins of the 
invention, which determine solubility are in the same positions as in the AChBP 
having the amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 6 or 8. 
The solubility determining regions are based on solvent accessibility in structure. The 
respective amino acid residues can be chosen for example according to Figure 10 or 

25 11 in which the solvent accessible regions are indicated. Preferably, the water- 
soluble ligand-binding protein of the invention comprises an amino acid sequence 
having at least 40% amino acid identity to the amino acid sequence of the mature 
AChBP comprising the amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8, 
in which the ligand binding amino acids have been replaced with the corresponding 

30 amino acids of a ligand-gated receptor. 

In one embodiment of the protein of the invention said solubility-determining amino 
acids (a) comprise hydrophilic amino acids (Asp, Glu, Arg, Lys) from the sequences 
20-44, 73-81, 86-92, 112-120, 135-152, 166-189, 196-20, 209-213, and/or 219-227 
of SEQ ID No. 2. 

35 The amino acid sequences of L-AChBP_T1 (SEQ ID No. 2) and T2 (SEQ ID No. 4) 
are almost similar. For the sake of clarity, reference is always made to L-AChBP JT1 
(SEQ ID No. 2). However, all references to amino acid residues within are valid for 
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both T1 and T2, with the noticeable exceptions of Arg(167) becoming Gly(167) and 
and Thr(203) becoming lle(203). Furthermore, regarding the amino acid residues 
(domains) from L-AChBP_T1 and the corresponding residues from B-AChBP the 
following list provide those amino acid positions in which L-AChBP and B-AChBP 

5 differ. All amino acid residue numbers below correspond to their position within the 
amino acid sequence of the immature protein (numbering starting at methionine (1). 
One could also start numbering at the start of the amino acid sequence of the mature 
sequence (L(1)DRAD for L-AChBP and Q(1)IRW for B-AChBP). When using this 
second method (1st amino acid of the mature seq. = position 1) simply subtract 19 

10 from the L-AChBP position numbers and 21 from the B-AChBP position numbers, for 
example Asp(36) becomes Asp(17) for L-AChBP) and Asp(15) for B-AChBP. For the 
further embodiments the positions are given for L-AChBP T1 (SEQ ID No. 2) followed 
by an indication of the corresponding amino acid positions in the amino acid 
sequence of L-AChBP_T2 (SEQ ID No. 4) .and B-AChBP_T1 (SEQ ID No. 6) & B- 

15 AChBP_T2 (SEQ ID No. 8) in the form of (L-AChBP_T1&T2 : B-AChBP_T1&T2). 

In a preferred embodiment said solubility determining amino acids (a) comprise 
amino acids Asp(36), Asp(68), Glu(115), Arg(137), Asp(143), Asp(148), Glu(150), 
Arg(167), Arg(189), Glu(215) of SEQ ID No.2, wherein Asp may be exchanged for 
20 Glu and vice versa and Lys may be exchanged for Arg and vice versa (L- 
AChBP_T1&T2 : B-AChBP JT1&T2; Asp(36) : Asp(36); Asp(68): Asp(68); Glu(115): 
Glu(116); Arg(137): Arg(138); Asp(143): Asp(144); Asp(148): Asp(149); Glu(150): 
Glu(151); Arg(167): Gly(167), in L-AChBP JT2 : Lys(168); Arg(189): Lys(190); 
Glu(215):Glu(216). 

25 

In a still more preferred embodiment the water-soluble ligand-binding protein 
comprises the amino acids Cys(142), Thr(149), Ala(153), Thr(154), Cys(155), 
Arg(156), lle(157) and/or Lys(158) of SEQ ID No. 2. (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; Cys(142): Cys(143); Thr(149): Thr(150); Ala(153): Ala(154); 

30 Thr(154); Thr(155); Cys(155): Cys(156); Arg(156): Arg(157); lle(157): lle(158); 
Lys(158): Lys(159). In a further embodiment the water-soluble ligand-binding protein 
comprises either in addition or alternatively the amino acids (b) Pro(39), Trp(77), 
Trp(101), Pro(103), Asp(194), and/or Ser(161) of SEQ ID No. 2 (L-AChBP_T1 &T2 : 
B-AChBP_T1&T2; Pro(39): Pro(39); Trp(77): Trp(77); Trp(101): Trp(102); Pro(103): 

35 Pro(104); Ser(161): Ser(162); Asp(194): Ser(195). 

In a still further embodiment the water-soluble ligand-binding protein comprises either 
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in addition or alternatively to the above described embodiments amino acid 
sequences 1 65-1 69 and/or 200-203 of SEQ ID No. 2 have been exchanged with the 
corresponding sequence of the ligand-gated receptor (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; His(165)-lso(169):. Asp(166)-Phe(170) (B-AChBP_T1) : Asp(166)- 
5 Leu(170) (B-AChBP_T2); Asn(200)-Thr(203); lso(203) for L-AChBP_T2: Asn(201)~ 
Lys(204). 

The amino acids determining binding to the ligand of the nicotinic acetylcholine 
receptor include three stretches on the nAChR alpha subunits. These stretches 
contain amino acids that are conserved throughout the various nAChR alpha 

10 subunits and that are essential for ligand binding. These stretches (corresponding to 
the Torpedo alpha subunlt) are (numbering of nAChR al as depicted in SEQ ID No. 
9): Trp (108) - Tyr (115), Trp (108) and Tyr (115) being essential; Trp (171) - Tyr 
(173), the amino acids Trp (171) and Tyr (173) being essential; Tyr (210) - Tyr (217), 
the amino acids Tyr (210), Cys (212), Cys (213) and Tyr (217) being essential. In the 

15 chimeric proteins according to the invention, at least the essential amino acids of at 
least one of these stretches haven been substituted for the corresponding amino 
acids. Preferably, the entire stretches have been substituted. 



In a particularly preferred embodiment of the invention, the water-soluble ligand- 
20 binding protein is capable of binding a ligand of an acetylcholine receptor, wherein in 
said protein at least one of the amino acid sequences Trp(101) - Tyr(T108), Trp(162) 
- His(164) and Tyr(204) - Tyr(211) of SEQ ID No. 2 has been exchanged with the 
corresponding sequence of the acetylcholine receptor (L-AChBP_T1&T2 : B- 
AChBP_T1&T2; Trp(101)-Tyr(108): Trp(102)-Tyr(109); Trp(162)-His(164): Trp(163)- 
25 His(165), (B-AChBP_T1) : Trp(163)-Phe(165) (B-AChBP_T2); Tyr(204)-Tyr(211): 
Tyr(205)-Tyr(212). 

On the basis of homology to the AChBPs, it is possible to change amino acid 
residues in the original amino acid sequence of the ligand-gated ion channel, which 

30 are not critical to ligand-binding or essential for the tertiary and quaternary structure 
of the receptor but could be substituted to amino acid residues which according to 
the AChBP in particular the crystal structure contributes to their water-solubility. As a 
result the ligand-gated ion channel or its ligand-binding domain or the respective 
monomers and pentamers are for example expected to be more easily expressible in 

35 recombinant expression system and more importantly amenable to crystallization, 
allowing the construction of three-dimensional models of their ligand binding 
domains. 
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Thus, in another embodiment the present invention relates to a method for the 
production ot a water-soluble ligand-gated receptor or a corresponding ligand-binding 
domain or for improving the water solubility and accessibility to crystallization of such 
a receptor or domain, said method comprising altering the amino acid sequence of 
5 the extracellular domain of a ligand-gated receptor by way of substituting, adding, 
deleting or modifying at least one amino acid at a position corresponding to an amino 
acid determining or contributing to the water-solubility of the above-described water- 
soluble ligand-binding protein of the present invention The method of the invention 
can be performed using conventional techniques known in the art, for example, by 

10 using amino acid deletion(s), insertion(s), substitution(s), addition(s), and/or 
recombination(s) and/or any other modification(s) known in the art either alone or in 
combination. Methods for introducing such modifications in the DNA sequence 
underlying the amino acid sequence of the ligand-binding domain a ligand-gated ion 
channel are well known to the person skilled in the art; see, e.g., Sambrook, 

15 Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. 
The resulting ligand-gated receptor or ligand-binding domain retains comparable in 
vitro and preferably also in vivo ligand-binding activity to that of the ligand-gated ion 
channel, and more importantly, allow complete crystallization of the protein such that 
they may be characterized by X-ray crystallography. The X-ray crystallographic data 

20 can be used for example for identification and construction of possible therapeutic 
compounds in the treatment of various disease conditions. 

As has- been discussed herein before, the ligand-gated ion channel superfamily 
including nACh, 5-HT3, glycine, GABA A , and GABA C receptors as well as 
invertebrate glutamate ion-channels and MOD-1 serotonin channel contain 
25 extracellular ligand binding domains that are homologous to the AChBP. Many of 
these receptors are promising drug targets. Therefore, the ligand-gated receptor to 
be modified is preferably one of those of the mentioned superfamily, most preferably 
it is nAChR. 

Information on the nucleotide and amino acid sequences, structural elements, 
30 functional assays of the nAch, 5-HT3, glycine, GABA A , and GABA C receptors can be 

found in the prior art. For example, the nicotinic receptors at the amino acid level are 

described in Corringer et al M Annu. Rev. Pharmacol. Toxicol. 40 (2000), 431-458. 

Means for retrieving nucleotide and amino acid sequences, performing sequence 

alignments in order to identify the most likely critical amino acid residues are 
35 described below and in the examples; for further general information see the review 

on periplasmic binding protein (PBP), an ancient protein module present in multiple 

drug receptors by Felder et al., PharmSci. 1(2) (1999). 
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In a preferred embodiment of the method of the present invention, said at least one 
amino acid is altered to the corresponding amino acid of the amino acid sequence 
depicted in any one of SEQ ID Nos. 2, 4, 6 or 8, or to a an equivalent amino acid, 
5 preferably in which said solubility-determining amino acids comprise solvent 
accessible regions in the crystal structure according to Figure 10 or 11. Preferred 
amino acid sequence positions and amino acid substitutions are described above for 
the AChBP and can be applied generally in the method of the present invention. 
It is expected that the insertion of the loop Cys123-Cys136 of the mature AChBP 

10 SEQ ID No. 2 into the equivalent region (Cys127-Cys141) in the mature nicotinic a7 
homopentamer ligand binding domain creates an easily expressed form of this 
protein. Likewise, this loop or an equivalent loop from other water-soluble ligand 
proteins of the present invention can be inserted into the equivalent region of other 
homopentameric ligand binding domains of ligand gated ion channels such as the 

15 glycine receptor and the 5-HT3 receptor to create an easily expressed form of those 
proteins. 

Thus, in one embodiment, the present invention relates to any one of the above 
described methods, wherein loop Cys123-Cys136 of SEQ ID No. 2 is inserted into 
the corresponding region of the ligand binding domain of the ligand-gated receptor. 

20 

The above described water-soluble ligand-gated receptor or a corresponding ligand- 
binding domain are usually prepared by site-directed mutagenesis of the underlying 
encoding polynucleotide. Once the corresponding polynucleotide has been 
generated it can be used to express the altered ligand-gated receptor or a 
25 corresponding ligand-binding domain. Thus, the method of the present invention 
commonly comprises 

(a) culturing a host cell transfected with and capable of expressing a 

polynucleotide comprising a nucleotide sequence encoding the altered 
amino acid sequence; and optionally 
30 (b) recovering said water-soluble ligand-gated receptor or corresponding 
ligand-binding domain from the culture. 
Methods for the expression and purification of the water-soluble ligand-gated 
receptor or corresponding ligand-binding domain of the present invention are 
described further below. Preferably, the expression system described in Examples 4 
35 and 5, or corresponding expression systems are used. 
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The present invention also relates to the a water-soluble ligand-gated receptor and 
ligand-binding domain obtainable by the above described methods of the invention. 
Preferably, said water-soluble ligand-gated receptor exhibits a 10-fold, more 
preferably 100-fold, still more preferably 1000-fold and most preferably 10000-fold 
5 higher solubility in water than the corresponding wild type, preferably human ligand- 
gated receptor. However, improvements in water solubility of about 2 to 5 fold is also 
already advantageous. The average hydrophobicity may be in the range of -100 to - 
400. Accordingly, the present invention provides methods for the prediction and 
creation of mutants and chimeras of ligand binding domains, of homopentameric 
10 acetylcholine receptor subtypes and of other homopentameric ion channels with 
increased solubility. 

In one embodiment the water-soluble ligand-binding protein of the invention further 
comprises a spacer sequence allowing coupling with a carrier body. The spacer 
sequence may be an amino acid sequence encodable by a polynucleotide or other 
molecule such as polymethylene anchor groups commonly used in chip technology. 
The chimeric protein of the invention may further comprise a spacer sequence, which 
allows coupling of the protein to a carrier body. Such spacer sequence may be e.g. 
an oligo-histidine stretch attached to the C-terminus of the protein. Such an oligo- 
histidine stretch is capable of binding to Talon© metal affinity beads or similar 
carriers. Such binding stretches have no detectable influence on the pharmacological 
properties of the proteins. The chimeric proteins according to the invention can be 
used for screening of specific binding of potential drugs, in particular screening for 
modulators of ion-channel opening. Conventional in vitro screening techniques, such 
as phage display technology, can be used for this purpose. High-throughput assays, 
possibly in combination with combinatorial chemistry can also be used. Specific 
binding of test compounds to the (immobilised) chimeric proteins of the invention can 
be performed e.g. by competition binding assays using alpha bungarotoxin as a 
competitor. The invention also concerns test kits containing the proteins described 
above, together with further means for carrying out a screening test, such as carriers, 
labels, diluents, other chemicals etc. 

In addition, the present invention relates to fusion proteins comprising the water- 
soluble ligand-binding protein of the invention or a binding fragment thereof and a 
35 fragment of a ligand-gated receptor. The term "fusion protein" as used herein refers 
to protein constructs that are the result of combining multiple protein domains or 
linker regions for the purpose of gaining the combined functions of the domains or 
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linker regions. This is may be accomplished by molecular cloning of the nucleotide 
sequences encoding such domains to produce a new polynucleotide sequence that 
encodes the desired fusion protein. Alternatively, creation of a fusion protein may be 
accomplished by chemically joining two proteins. A fusion protein of the present 
5 invention preferably comprises at least the ligand-binding domain of the AChBP or of 
a ligand-gated. ion channel, which has been modified in accordance with the above 
described methods. 

Nicotinic acetylcholine receptors are comprised of five subunits, selected from a 

10 related family of subunit proteins. The neuronal subunits fall into two main types 
depending on the presence or absence of a pair of vicinal cysteines close to the 
binding site for acetylcholine. Thus all a-subunits contain paired cysteine residues 
thought to play a role in binding of nicotinic agonists (Aplin and Wonnacott, 48 (1994), 
473-477), whereas the p-subunits do not. There are ten known alpha subunits, a1 to 

15 a10, and at least four beta subunits, pi to p4. Receptors comprise at least one alpha 
subunit which in some cell types combine with a beta subunit and in some cases a 
gamma, delta and epsilon subunit. For example, the AChR at the neuromuscular 
junction is believed to have an (a1)2piy8 stoichiometry. Within the group of a- 
subunits there is marked diversity in the manner in which a complete functional 

20 nAChR is formed. The majority of the a subunits only form functional receptors when 
combined as a heteropentamer with p-subunits in the CNS (McGehee and Role, 
Annual Review of Physiology 57 (1995), 521-546). However, ct7, <x8 and a9 nAChR 
subunits and the related 5-HT3A subunit are capable of forming functional 
homopentameric receptors. In this respect it is interesting that the phylogenetic 

25 relationship between nAChR subunits suggest that <x7, <x8, <x9 and the related 5-HT3A 
subunit are more related to each other than to the subunits which only form 
heteropentameric receptors. Sequence homologies indicate that the a7, a8 and a9 
subunits form a distinct subgroup of the alpha subunits. 

As is evident form the foregoing, the above described water-soluble ligand-binding 
30 protein or receptor or ligand-binding domain thereof can be used for forming 
complexes of homo- or heteromultimers, such as a dimer, pentamer or decamer 
consisting of at least one monomer of the mentioned proteins of the present 
invention. Preferably, these multimers constitute ai function ligand-gated receptor. 
Preferably, said ligand-gated receptor is related to the nAchR. 

35 

The present invention also relates to the production of synthetic heteropentamers 
resembling heteropentameric gated ion-channels by mutation of AChBP, using 
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knowledge of the crystal structure about the primary and secondary contact regions; 
see infra. Preferably, said synthetic heteropentamers resembles a heteropentameric 
nicotinic acetylcholine receptor. Accordingly, the present invention more generally 
relates to a ligand-gated ion channel comprising any one of the above described 
5 water-soluble ligand-bjnding proteins or receptors of the invention as a monomer, 
homo- or heterodimer or -pentamer. This method therefore allows the prediction and 
creation of mutants and chimeras of nicotinic acetylcholine receptors and other 
ligand-gated ion channels that are insensitive or more sensitive to toxin binding, e.g. 
bungarotoxin, lophotoxin, conotoxin, and other toxins that inhibit ligand-gated ion 
10 channels. Preferably, said ligand-gated ion channel is less or more sensitive to 
binding of toxins such as bungarotoxin, lophotoxin or conotoxin compared to the wild 
type ligand-gated ion channel. 

Further information and examples how to create chimeric ligand-binding proteins in 
accordance with the present invention is given in Example 10. 

15 

The nucleotide and amino acid sequences of the acetylcholine, 5-HT3, glycine, 
GABA A , and GABA C receptors can be easily retrieved from public database, for 
example from the internet using http://www.ncbi.nlm.nih.gov/Entrez. The citations 
also include a reference to the corresponding publication also reporting on the 

20 functional expression of the respective receptor. 

The use of recombinant acetylcholine-gated ion channels and functionally assays in 
the discovery of putative novel ligands has been described in Cosford, Pharm. Acta 
Helv. (2000), 74(2-3), 125-130. Furthermore, the cell-free expression and functional 
reconstitution of homo-oligomeric a7 nicotinic acetylcholine receptors into planar lipid 

25 bllayers has been reported by Lyford and Rosenberg, J. Bioi. Chem. (1999), 
274(36), 25675-25681. The use of functional assays of cloned and native muscarinic 
acetylcholine receptors for determining the selectivity profile of toxins has been 
described by Oiianas et al. (J. Pharmacol. Exp. Ther. 288 (1999), 164-170). A system 
for the evaluation of pharmacological differences and similarities between 5-HT3 

30 receptors stably transfected cells is provided by for example Bruss et al., Naunyn- 
Schmiedebergs Archives of Pharmacology 360 (1999), 225-33. The primary structure 
and functional expression of the 5-HT3 receptor is described in Maricq et al., Science 
254 (1991), 432-437. Likewise/the stable expression of human glycine a1 and a2 
receptor monomers in mouse L(tk-) cells and their use for the study of the physiology 

35 and pharmacology of functional glycine receptors is described in Wick et al., J. 
Neurosci. Methods 87 (1999), 97-103. An example for the measurement of the 
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pharmacology of recombinant GABA A receptor subtypes is described in Simpson et 
al., J. Neurosci. Methods 99 (2000), 91-100. Further examples for assay systems are 
given below. 

The described methods as well as others known to the person skilled in the art can 
5 be used for example to 

(1) express and characterise the water-soluble ligand-binding proteins and ligand- 
gated ion channels of the present invention; and 

(2) use stably transfected cells expressing the above described ligand-gated ion 
channels for the identification of novel ligands. 

10 

The present invention also relates to polynucleotides encoding the water-soluble 
ligand-binding proteins and ligand-gated ion channels of the present invention, and 
multimers thereof, preferably dimers or pentamers. Such polynucleotide may be a 
DNA such as a cDNA, or an RNA such as mRNA or any other form of nucleic acid 

15 including synthetic or modified derivatives and may encode the polypeptide in a 
continuous sequence or in a number of sequences interrupted by intervening 
sequences. In which ever form it is present, the polynucleotide is an isolated 
polynucleotide in that it is removed from its naturally-occurring state. This aspect of 
the invention is based on the cloning of the cDNA for ligand-binding proteins. In a 

20 preferred embodiment, the polynucleotide comprises the nucleotide sequence of any 
one of SEQ ID Nos. 1, 3, 5 or 7, optionally including one or more mutations or 
deletions which do not substantially affect the activity of the polypeptide encoded 
thereby. Such mutations include those arising from the degeneracy of the genetic 
code, as well as those giving rise to any of the amino acid mutations or deletions 

25 discussed above. The polynucleotides of the invention preferably comprise 

(a) a nucleotide sequence having at least 1 5 continuous nucleotides of 
the nucleotide sequence depicted in any one of SEQ ID Nos. 1 , 3, 5 or 
7 or a degenerated nucleotide sequence thereof; or 

(b) a nucleotide sequence capable of hybridizing to a nucleotide 
30 sequence of (a) under stringent hybridisation conditions. 

Typically, selective hybridization will occur when there is at least about 55% 
sequence identity - preferably at least about 65%, more preferably at least about 
75%, and most preferably at least about 90% - over a stretch of at least about 14 
35 nucleotides; see, e.g., Kanehisa, Nucleic Acids Res. 12 (1984), 203-213, herein 
incorporated by reference. Nucleic acid hybridization will be affected by such 
conditions as salt concentration, temperature, solvents, the base composition of the 



Page 18 of 259 



WO 01/58951 PCT/EP01/01457 

-17 

hybridizing species, length of the complementary regions, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids, as will be readily 
appreciated by those skilled in the art. 

"Stringent hybridization conditions" and "stringent wash conditions" in the context of 
5 nucleic acid hybridization experiments depend upon a number of different physical 
parameters. The most important parameters include temperature of hybridization, 
base composition of the nucleic acids, salt concentration and length of the nucleic 
acid. One having ordinary skill in the art knows how to vary these parameters to 
achieve a particular stringency of hybridization. In general, "stringent hybridization" 
10 is performed at about 25°C below the thermal melting point (T m) for the specific 
DNA hybrid under a particular set of conditions. 

"Stringent washing" is performed at temperatures about 5°C lower than the Tm for 
the specific DNA hybrid under a particular set of conditions. The Tm is the 
temperature at which 50% of the target sequence hybridizes to a perfectly matched 

15 probe; see Sambrook et al., page 9.51, hereby incorporated by reference. The Tm 
for a particular DNA-DNA hybrid can be estimated by the formula: 
Tm = 81.5°C + 16.6 (log10[Na+ ]) + 0.41 (fraction G + C) - 0.63 (% formamide) - 
(600/I) where I is the length of the hybrid in base pairs. 
The Tm for a particular RNA-RNA hybrid can be estimated by the formula: 

20 Tm = 79.8°C + 18.5 (log10[Na+ ]) + 0.58 (fraction G + C) + 11.8 (fraction G + C)2 - 
0.35 (% formamide) - (820/I). 

The Tm for a particular RNA-DNA hybrid can be estimated by the formula: 

Tm = 79.8°C + 18.5(log10[Na+ ]) + 0.58 (fraction G + C) + 1 1 .8 (fraction G + C)2 - 

0.50 (% formamide) - (820/I). 

25 In general, the Tm decreases by 1-1 .5°C for each 1% of mismatch between two 
nucleic acid sequences. Thus, one having ordinary skill in the art can alter 
hybridization and/or washing conditions to obtain sequences that have higher or 
lower degrees of sequence identity to the target nucleic acid. For instance, to obtain 
hybridizing nucleic acids that contain up to 10% mismatch from the target nucleic 

30 acid sequence, 10-15°C would be subtracted from the calculated Tm of a perfectly 
matched hybrid, and then the hybridization and washing temperatures adjusted 
accordingly. Probe sequences may also hybridize specifically to duplex DNA under 
certain conditions to form triplex or other higher order DNA complexes. The 
preparation of such probes and suitable hybridization conditions are well known in 

35 the art. An example of stringent hybridization conditions for hybridization of 
complementary nucleic acid sequences having more than 100 complementary 
residues on a filter in a Southern or Northern blot or for screening a library is 50% 
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formamide/6X SSC at 42°C for at least ten hours. Another example of stringent 
hybridization conditions is 6X SSC at 68°C for at least ten hours. An example of low 
stringency hybridization conditions for hybridization of complementary nucleic acid 
sequences having more than 100 complementary residues on a filter in a Southern or 

5 northern blot or for screening a library is 6X SSC at 42°C for at least ten hours. 
Hybridization conditions to identify nucleic acid sequences that are similar but not 
identical can be identified by experimentally changing the hybridization temperature 
from 68°C to 42°C while keeping the salt concentration constant (6X SSC), or 
keeping the hybridization temperature and salt concentration constant (e.g. 42°C and 

10 6X SSC) and varying the formamide concentration from 50% to 0%. Hybridization 
buffers may also include blocking agents to lower background. These agents are 
well-known in the art; see Sambrook et al., pages 8.46 and 9.46-9.58, herein 
incorporated by reference. Wash conditions also can be altered to change stringency 
conditions. An example of stringent wash conditions is a 0.2x SSC wash at,65°C for 

15 15 minutes (see Sambrook et al., for SSC buffer). Often the high stringency wash is 
preceded by a low stringency wash to remove excess probe. An exemplary medium 
stringency wash for duplex DNA of more than 100 base pairs is 1x SSC at 45°C for 
15 minutes. An exemplary low stringency wash for such a duplex is 4x SSC at 40°C 
for 15 minutes. In general, signal-to-noise ratio of 2x or higher than that observed for 

20 an unrelated probe in the particular hybridization assay indicates detection of a 
specific hybridization. 

By the provision of the nucleotide sequences of SEQ ID JMos. 1 , 3, 5 and 7 as well as 
those encoding the amino acid sequences depicted in SEQ ID Nos. 2, 4, 6 and 8 it is 
possible to isolate identical or similar nucleic acid molecules which encode water- 

25 soluble ligand-binding proteins from other species or organisms, in particular 
orthologous water-soluble ligand-binding protein encoding genes from mammals. 
The term "orthologous" as used herein means homologous sequences in different 
species that arose from a common ancestor gene during speciation. Orthologous 
genes may or may not be responsible for a similar function; see, e.g., the glossary of 

30 the "Trends Guide to Bioinformatics", Trends Supplement 1998, Elsevier Science. 

In a further aspect, the present invention provides a recombinant polynucleotide 
comprising a vector incorporating the polynucleotide of the present invention. Many 
suitable vectors are known to those skilled in molecular biology, the choice of which 
35 would depend on the function desired and include plasmids, cosmids, viruses, 
bacteriophages and other vectors used conventionally in genetic engineering. 
Methods which are well known to those skilled in the art can be used to construct 



WO 01/58951 



Page 20 of 259 



WO 01/58951 PCT/EP01/01457 

-19 

various plasmids and vectors; see, for example, the techniques described in 
Sambrook, Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory 
(1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. (1989), (1994). Alternatively, the 

5 polynucleotides and vectors of the invention can be reconstituted into liposomes for 
delivery to target cells. As discussed in further details below, a cloning vector was 
used to isolate individual sequences of DNA. Relevant sequences can be transferred 
into expression vectors where expression of a particular polypeptide is required. 
Typical cloning vectors include pBscpt sk, pGEM, pUC9, pBR322 and pGBT9. 

10 Typical expression vectors include pTRE, pCAL-n-EK, pESP-1, pOP13CAT, pET, 
pGEX, pMALC, pPIC9, pBac. 

Hence, in a preferred embodiment of the present invention the above-described 
polynucleotides either alone or present in a vector are linked to control sequences 
15 which allow the expression of the polynucleotide In prokaryotic and/or eukaryotic 
cells. 

The term "control sequence" refers to regulatory DNA sequences which are 
necessary to effect the expression of coding sequences to which they are ligated. 
The nature of such control sequences differs depending upon the host organism. In 

20 prokaryotes, control sequences generally include promotor, ribosomal binding site, 
and terminators. In eukaryotes generally control sequences include promotors, 
terminators and, in some instances, enhancers, transactivators or transcription 
factors. The term "control sequence" is intended to include, at a minimum, all 
components the presence of which are necessary for expression, and may also 

25 include additional advantageous components. 

The term "operably linked" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under conditions compatible with the 

30 control sequences. In case the control sequence is a promotor, it is obvious for a 
skilled person that double-stranded nucleic acid is preferably used. 
Thus, the vector of the invention is preferably an expression vector. An "expression 
vector" is a construct that can be used to transform a selected host cell and provides 
for expression of a coding sequence in the selected host. Expression vectors can for 

35 instance be cloning vectors, binary vectors or integrating vectors. Expression 
comprises transcription of the nucleic acid molecule preferably into a translatable 
mRNA. Regulatory elements ensuring expression in prokaryotic and/or eukaryotic 
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cells are well known to those skilled in the art. In the case of eukaryotic cells they 
comprise normally promoters ensuring initiation of transcription and optionally poly-A 
signals ensuring termination of transcription and stabilization of the transcript. 
Possible regulatory elements permitting expression in prokaryotic host cells 
5 comprise, e.g., the PL, lac, trp, T7 or tac promotor in E. coli, and examples of 
regulatory elements permitting expression in eukaryotic host cells are the AOX1 or 
GAL1 promotor in yeast or the CMV-, SV40-, RSV-promotor (Rous sarcoma virus), 
CMV-enhancer, SV40-enhancer or a globin intron in mammalian and other animal 
cells. In this context, suitable expression vectors are known in the art such as 

10 Okayama-Berg cDNA expression vector pcDV1 (Pharmacia), pCDM8, pRc/CMV, 
pcDNAI , pcDNA3 (In-vitrogene), pSPORTI (GIBCO BRL). An alternative expression 
system which could be used to express the protein is an insect system. In one such 
system, Autographa caiifornica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 

15 larvae. The coding sequence of a nucleic acid molecule of the invention may be 
cloned into a nonessential region of the virus, such as the polyhedrin gene, and 
placed under control of the polyhedrin promotor. Successful insertion of said coding 
sequence will render the polyhedrin gene inactive and produce recombinant virus 
lacking coat protein coat. The recombinant viruses are then used to infect S. 

20 frugiperda cells or Trichoplusia larvae in which the protein of the invention is 
expressed (Smith, J. Virol. 46 (1983), 584; Engelhard, Proc. Nat. Acad. Sci. USA 91 
(1994), 3224-3227). 

In plants, promoters commonly used are the polyubiquitin promotor, and the actin 
promoter for ubiquitous expression. The termination signals usually employed are 

25 from the Nopaline Synthase promotor or from the CAMV 35S promotor. A plant 
translational enhancer often used is the TMV omega sequences, the inclusion of an 
intron (Intron-1 from the Shrunken gene of maize, for example) has been shown to 
increase expression levels by up to 100-fold, (Mait, Transgenic Research 6 (1997), 
143-156; Ni, Plant Journal 7 (1995), 661-676). Additional regulatory elements may 

30 include transcriptional as well as translational enhancers. Advantageously, the 
above-described vectors of the invention comprises a selectable and/or scorable 
marker. Selectable marker genes useful for the selection of transformed cells and, 
e.g., plant tissue and plants are well known to those skilled in the art and comprise, 
for example, antimetabolite resistance as the basis of selection for dhfr, which 

35 confers resistance to methotrexate (Reiss, Plant Physiol, (Life Sci. Adv.) 13 (1994), 
143-149); npt, which confers resistance to the aminoglycosides neomycin, 
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kanamycin and paromycin (Herrera-Estrella, EMBO J. 2 (1983), 987-995) and hygro, 
which confers resistance to hygromycin (Marsh, Gene 32 (1984), 481-485). 
Useful scorable markers are also known to those skilled in the art and are 
commercially available. Advantageously, said marker is a gene encoding luciferase 
(Giacomin, PI. Sci. 116 (1996), 59-72; Scikantha, J. Bact. 178 (1996), 121), green 
fluorescent protein (Gerdes, FEBS Lett. 389 (1996), 44-47) or 3-glucuronidase 
(Jefferson, EMBO J. 6 (1987), 3901-3907). This embodiment is particularly useful for 
simple and rapid screening of cells, tissues and organisms containing a vector of the 
invention. 

The proteins can be recovered and purified from recombinant cell cultures by well- 
known methods including ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, size exclusion chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ( n HPLC 11 ) or FPLC is employed 
for purification. 

The present invention furthermore relates to host cells produced by introducing a 
nucleic acid molecule into the host cell which upon its presence in the cell mediates 
the expression of a gene encoding water-soluble ligand-binding proteins or 
comprising a polynucleotide or a vector as described above or a polynucleotide 
according to the invention wherein the polynucleotides and/or nucleic acid molecule 
is foreign to the host cell. By "foreign" it is meant that the polynucleotide or nucleic 
acid molecule is either heterologous with respect to the host cell, this means derived 
from a cell or organism with a different genomic background, or is homologous with 
respect to the host cell but located in a different genomic environment than the 
naturally occurring counterpart of said nucleic acid molecule. This means that, if the 
nucleic acid molecule is homologous with respect to the host cell, it is not located in 
its natural location in the genome of said host cell, in particular it is surrounded by 
different genes. In this case the polynucleotide may be either under the control of its 
own promotor or under the control of a heterologous promotor. The vector or nucleic 
acid molecule according to the invention which is present in the host cell may either 
be integrated into the genome of the host cell or it may be maintained in some form 
extrachromosomally. In this respect, it is also to be understood that the nucleic acid 
molecule of the invention can be used to restore or create a mutant gene via 
homologous recombination. 
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The host cell can be any prokaryotic or eukaryotic cell, such as bacterial, insect, 
fungal, plant or animal cells. 

The term "prokaryotic" is meant to include all bacteria which can be transformed or 
transfected with a DNA or RNA molecules for the expression of a protein of the 

5 invention. Prokaryotic hosts may include gram negative as well as gram positive 
bacteria such as, for example, E. coli, S. typhimurium, Serratia marcescens and 
Bacillus subtilis. The term "eukaryotic" is meant to include yeast, higher plant, insect 
and preferably mammalian cells. Depending upon the host employed in a 
recombinant production procedure, the protein encoded by the polynucleotide of the 

10 present invention may be glycosylated or may be non-glyc9sylated. The water- 
soluble ligand-binding protein of the invention may or may not also include an initial 
methionine amino acid residue. A polynucleotide of the invention can be used to 
transform or transfect the host using any of the techniques commonly known to those 
of ordinary skill in the art. Furthermore, methods for preparing fused, operably linked 

15 genes and expressing them in, e.g., mammalian cells and bacteria are well-known in 
the art (Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1989). 

Thus the present invention provides a cell capable of expressing a polypeptide as 
discussed herein. The cell comprises a recombinant host cell usually incorporating 
20 the polynucleotide. Preferably, the host cell incorporates the polynucleotide as the 
recombinant polynucleotide. Any suitable host cell may be chosen, again depending 
on the intended purpose. Suitable host cells include XLI-BLUE, B21(DE3)pLysS, 
HB1 01 , SOLR and SP-Q01 (Saccharomyces pombe). 

Using an appropriate combination of host cell, vector and polynucleotide, an 
25 expression system can be provided so as to obtain a polypeptide useful in the 
presen t invention. This may comprise a fusion polypeptide encoded by the 
recombinant polynucleotide, a part of which is encoded by the vector. Typically, the 
vector will have a promotor region, which is usually inducible, leading to 5' coding 
region associated with the promotor. By appropriate manipulation, the polynucleotide 
30 encoding the polypeptide can be attached to the 5' coding region in frame. In this 
way, expression of the nucleotide sequence downstream of the promotor region 
gives rise to the fusion polypeptide which includes the polypeptide of the present 
invention. 

35 The present invention also relates to an antigen comprising an epitope of at least 5 
continuous amino acids of the amino acid sequence depicted in any one of SEQ ID 
Nos. 2, 4, 6 or 8 and/or said epitope is detectable by a monoclonal or polyclonal 
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antibody which recognises, preferably with a. binding affinity of at least 10* 7 M, a 
protein of the invention as described above. In the present invention, "epitopes" 
refers to fragments of the AChBP of the invention having antigenic or immunogenic 
activity in an animal. A preferred embodiment of the present invention relates to 
5 antigens comprising an epitope, as well as the polynucleotide encoding this 
fragment. A region of a protein molecule to which an antibody can bind is defined as 
an "antigenic epitope." In contrast, an "immunogenic epitope" is defined as a part of a 
protein that elicits an antibody response; see, for instance, Geysen, Proc. Natl. Acad. 
Sci. USA 81 (1983); 3998-4002. Fragments which function as epitopes may be 

1 0 produced by any conventional means; see, e.g., Houghten, Proc. Natl. Acad. Sci. 
USA 82 (1985), 5131-5135 further described in U.S. Patent No. 4,631,211. In the 
present invention, antigenic epitopes preferably contain a sequence of at least five, 
six, seven, more preferably at least nine, and most preferably between about 1 5 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 

15 monoclonal antibodies, that specifically bind the epitope; see, for instance, Wilson, 
Cell 37 (1984), 767-778; Sutcliffe, Science 219 (1983), 660-666). Similarly, 
immunogenic epitopes can be used to induce antibodies according to methods well 
known in the art; see, for instance, Sutcliffe, supra; Wilson, supra; Chow, Proc. Natl. 
Acad. Sci. USA 82 (1985), 910-914; and Bittle, J. Gen. Virol. 66 (1985); 2347-2354. 

20 A preferred immunogenic epitope includes the soluble protein. The immunogenic 
epitopes may be presented together with a carrier protein, such as an albumin, to an 
animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. However, immunogenic epitopes comprising as few 
as 8 to 10 amino acids have been shown to be sufficient to raise antibodies capable 

25 of binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., in 
Western blotting.) 

The present invention also relates to antibodies specifically recognizing the water- 
soluble ligand-binding protein and ligand-gated ion channels of the present invention, 

30 in particular recognizing the above described antigen or epitope. As used herein, the 
term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact 
molecules as well as antibody fragments (such as, for example, Fab and F(ab') 2 
fragments) which are capable of specifically binding to protein. Fab and F(ab') 2 
fragments lack the Fc fragment of intact antibody, clear more rapidly from the 

35 circulation, and may have less non-specific tissue binding than an intact antibody; 
see, e.g., Wahl, J. Nucl. Med. 24 (1983), 316-325. Thus, these fragments are 
preferred, as well as the products of a FAB or other immunoglobulin expression 
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library. Moreover, antibodies of the present invention include chimeric, single chain, 
and humanized antibodies; see also infra. Said antibody can be a monoclonal 
antibody, a polyclonal antibody, a single chain antibody, human or humanized 
antibody, primatized, chimerized or fragment thereof that specifically binds said 
peptide or polypeptide also including bispecific antibody, synthetic antibody, antibody 
fragment, such as Fab, Fv or scFv fragments etc., or a chemically modified derivative 
of any of these. The general methodology for producing antibodies is well-known and 
has been described in, for example, Kohler and Milstein, Nature 256 (1975), 494 and 
reviewed in J.G.R. Hurrel, ed., "Monoclonal Hybridoma Antibodies: Techniques and 
Applications", CFtC Press Inc., Boco Raron, FL (1982), as well as that taught by L. T. 
Mimms et al., Virology 176 (1990), 604-619. Furthermore, antibodies or fragments 
thereof to the aforementioned peptides can be obtained by using methods which are 
described, e.g., in Harlow and Lane "Antibodies, A Laboratory Manual", CSH Press, 
Cold Spring Harbor, 1988. For the production of antibodies in experimental animals, 
various hosts including goats, rabbits, rats, mice, and others, may be immunized by 
injection with polypeptides of the present invention or any fragment or oligopeptide or 
derivative thereof which has immunogenic properties. Techniques for producing and 
processing polyclonal antibodies are known in the art and are described in, among 
others, Mayer and Walker, eds., "Immunochemical Methods in Cell and Molecular 
Biology", Academic Press, London (1987). Polyclonal antibodies also may be 
obtained from an animal, preferably a mammal, previously infected with the virus of 
the invention. Methods for purifying antibodies are known in the art and comprise, for 
example, immunoaffinity chromatography. Depending on the host species, various 
adjuvants or immunological carriers may be used to increase immunological 
responses. Such adjuvants include, but are not limited to, Freund's, complete or 
incomplete adjuvants, mineral gels such as aluminium hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions 
and dinitrophenol. An example of a carrier, to which, for instance, a peptide of the 
invention may be coupled, is keyhole limpet hemocyanin (KLH). When derivatives of 
said antibodies are obtained by the phage display technique, surface plasmon 
resonance as employed in the BIAcore system can be used to increase the efficiency 
of phage antibodies which bind to an- epitope of the peptide or polypeptide of the 
invention (Schier, Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. 
Immunol. Methods 183 (1995), 7-13).' In many cases, the binding phenomena of 
antibodies to antigens is equivalent to other ligand/anti-ligand binding. 
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In another embodiment the present invention relates to an oligonucleotide probe 
comprising a nucleotide sequence having at least 15 continuous nucleotides of a 
polynucleotide of the invention and/or encoding the above described antigen. Such 
oligonucleotides will usually specifically hybridize to a polynucleotide encoding a 

5 water-soluble ligand-binding protein of the invention. Specific hybridization occurs 
preferably under stringent conditions and implies no or very little cross-hybridization 
with nucleotide sequences encoding no or substantially different proteins. Such 
nucleic acid molecules may be used as probes and/or for the control of gene 
expression. Nucleic acid probe technology is well known to those skilled in the art 

1 0 who will readily appreciate that such probes may vary in length. Preferred are nucleic 
acid probes of 1 7 to 35 nucleotides in length. Of course, it may also be appropriate to 
use nucleic acids of up to 100 and more nucleotides in length. The nucleic acid 
probes of the invention are useful for various applications. On the one hand, they 
may be used as PCR primers for amplification of polynucleotides according to the 

15 invention. Another application is the use as a hybridization probe to identify 
polynucleotides hybridizing to the polynucleotides of the invention by homology 
screening of. genomic DNA libraries. Nucleic acid molecules according to this 
preferred embodiment of the invention which are complementary to a polynucleotide 
as described above may also be used for repression of expression of a gene 

20 comprising such a polynucleotide, for example due to an antisense or triple helix 
effect or for the construction of appropriate ribozymes (see, e.g., EP-B1 0 291 533, 
EP-A1 0 321 201, EP-A2 0 360 257) which specifically cleave the (pre)-mRNA of a 
gene comprising a polynucleotide of the invention. Selection of appropriate target 
sites and corresponding ribozymes can be done as described for example in 

25 Steinecke, Ribozymes, Methods in Cell Biology 50, Galbraith et al. eds Academic 
Press, Inc. (1995), 449-460, Standard methods relating to antisense technology have 
also been described (Melani, Cancer Res. 51 (1991), 2897-2901). Said nucleic acid 
molecules may be chemically synthesized or transcribed by an appropriate vector 
containing a chimeric gene which allows for the transcription of said nucleic acid 

30 molecule in the cell. Such nucleic acid molecules may further contain ribozyme 
sequences as described above. 

In this respect, it is also to be understood that the polynucleotide of the invention can 
be used for "gene targeting" and/or "gene replacement", for restoring a mutant gene 
or for creating a mutant gene via homologous recombination; see for example 
35 Mouellic, Proc. Natl. Acad. Sci. USA, 87 (1990), 4712-4716; Joyner, Gene Targeting, 
A Practical Approach, Oxford University Press. 



WO 01/58951 PCT/EP01/01457 

-26 

Furthermore, the person skilled in the art is well aware that it is also possible to label 
such a nucleic acid probe with an appropriate marker for specific applications, such 
as for the detection of the presence of a polynucleotide of the invention in a sample 
derived from an organism, in particular mammals, preferably human. A number of 
5 companies such as Pharmacia Biotech (Piscataway NJ), Promega (Madison Wl), 
and US Biochemical Corp (Cleveland OH) supply commercial kits and protocols for 
these procedures. Suitable reporter molecules or labels include those radionuclides, 
enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as 
substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the 
10 use of such labels include US Patents US-A-3,81 7,837; US-A-3,850,752; US-A- 
3,939,350; US-A-3,996,345; US-A-4,227,437; US-A-4,275,149 and US-A-4,366,241 . 
Also, recombinant immunoglobulins may be produced as shown in US-A-4,81 6,567 
incorporated herein by reference. 

Furthermore, the so-called "peptide nucleic acid" (PNA) technique can be used for 
15 the detection or inhibition of the expression of a polynucleotide of the invention. For 
example, the binding of PNAs to complementary as well as various single stranded 
RNA and DNA nucleic acid molecules can be systematically investigated using 
thermal denaturation and BIAcore surface-interaction techniques (Jensen, 
Biochemistry 36 (1997), 5072-5077). 

20 

The present invention also relates to a method for the production of a transgenic non- 
human animal, preferably transgenic mouse, comprising introduction of a 
polynucleotide or vector of the invention into a germ cell, an embryonic cell, stem cell 
or an egg or a cell derived therefrom. The non-human animal can be used in 

25 accordance with a screening method of the invention described herein. Production of 
transgenic embryos and screening of those can be performed, e.g., as described by 
A. L Joyner Ed., Gene Targeting, A Practical Approach (1993), Oxford. University 
Press. The DNA of the embryonal membranes of embryos can be analyzed using, 
e.g., Southern blots with an appropriate probe; see supra. The invention also relates 

30 to transgenic non-human animals such as transgenic mouse, rats, hamsters, dogs, 
monkeys, rabbits, pigs, C. elegans and fish such as Torpedo fish comprising a 
polynucleotide or vector of the invention or obtained by the method described above, 
preferably wherein said polynucleotide or vector is stably integrated into the genome 
of said non-human animal, preferably such that the presence of said polynucleotide 

35 or vector leads to the expression of the water-soluble protein of the present invention. 
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The present invention further relates to composition comprising any one of the above 
described water-soluble ligand-binding proteins, multimers such as dimers or 
pentamers thereof, ligand-gated ion channels, polynucleotides, vectors, host cells, 
antigens, antibodies, or oligonucleotide probes of the invention; and optionally 
5 suitable means for detection or performing a ligand-receptor binding assay. In this 
context, the present invention also relates to a method for identifying an 
agonist/activator or antagonist/inhibitor of a ligand-gated receptor comprising the 
steps of: 

(a) contacting the water-soluble ligand-binding protein of the present invention, 
10 multimers such as dimers or pentamers thereof, or the ligand-gated ion 

channel of the invention or a cell expressing said protein in the presence of 
components capable of providing a detectable signal in response to ligand 
binding with a compound to be screened under conditions that permit binding 
of said compound to the ligand-binding protein; and 
15 (b) detecting the presence or absence of a signal generated from the binding 
activity of the ligand-binding protein, wherein the presence/increase and 
absence/decrease of the signal is indicative for an agonist/activator and 
antagonist/inhibitor, respectively, of a ligand-gated receptor. 



20 Since ligand-gated receptors are modulated allosterically by natural polyamines, 
such as spermine, and by polyamine derivatives, such as polyamine amides (e.g. 
philanthotoxin-343) and polymethylene tetraamines (e.g. methoctramine) 
(Usherwood, Farmaco. 55 (2000), 202-205) compounds comprising or based on 
such entities may be used as starting material for screening. An antagonist or agonist 

25 that "modulates the activity" of a polypeptide and causes an altered signal, for 
example response in the cell refers to a compound that alters the activity of the 
protein so that it behaves differently in the presence of the compound than in the 
absence of the compound. Typically, the effect of an antagonist is observed as a 
blocking of agonist-induced receptor activation. Antagonists include competitive as 

30 well as non-competitive antagonists. A competitive antagonist (or competitive 
blocker) interacts with or near the site specific for agonist binding. A non-competitive 
antagonist or blocker inactivates the function of the receptor by interacting with a site 
other than the agonist interaction site. As understood by those of skill in the art, 
bioassay methods for identifying compounds that modulate the activity of receptors 

35 such as proteins of the invention generally require comparison to a control. One type 
of "control" is a cell or culture that is treated substantially the same as the test cell or 
test culture exposed to the compound, with the distinction that the "control" cell or 
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culture is not exposed to the compound. For example, in methods that use voltage 
clamp electrophysiological procedures, the same cell can be tested in the presence 
or absence of compound, by merely changing the external solution bathing the cell. 
Accordingly, the response of the transfected cell to the "control" cell or culture to the 
5 same compound under the same reaction conditions. However, "control data" can 
also be used from the literature. 

As described in Example 6 the 3-dimensional structure of AChBP could be solved by 
X-ray crystallography at 2.7A resolution (current Rfactor = 27.9 %, Rfree = 30.0 %). 
In crystals, as in solution, AChBP forms a stable homo-pentamer with dimensions 
comparable to those of the ligand-binding domain of ligand-gated ion channels, in 
particular comparable to the nAChR, as determined in EM studies by Unwin and 
coworkers; see supra. The structural analysis revealed that in the AChBP 
homopentamer the monomers have immunoglobulin-like topology. At each of five 
subunit interfaces a ligand-binding site is located, with all residues consistent with 
biochemical data. In this site a buffer molecule (HERPES) stacks with cation-71 
interactions on a tryptophan, resembling acetylcholine binding. The AChBP structure 
is relevant for the development of drugs against, e.g., Alzheimer's disease and 
nicotine addiction. The high-resolution crystal structure of AChBP, along with 
biochemical and pharmacological data, supports the teaching of the present 
invention that the water-soluble ligand-binding proteins of the invention such as 
AChBP are good mimics of ligand-binding domains of ligand-gated ion channels. 

Thus, the present invention relates to a crystal of a water-soluble ligand-binding 
25 protein of the invention, preferably in a multimeric form such as dimer, pentamer or 
decamer. In one embodiment said crystal comprises a protein-ligand complex. 
Methods how to employ and analyze such crystals are known to the person skilled in 
the art; see for example US-A-5,872,01 1 which describes the crystal structure of a 
protein-ligand complex containing an N-terminal truncated elF4E and uses thereof. 
30 The crystal structure of the ligand-gated receptor ligand-binding region in a complex 
with a ligand, preferably being an antagonist or agonist will reveal the determinants of 
receptor-antagonist/agonist interactions and how ligand-binding specificity and 
affinity are altered by remote residues and the redox state of the conserved 
disulphide bond. The structure may also indicate mechanisms for aliosteric effector 
35 action and for ligand-induced channel gating. How the information on the crystal 
structure of a ligand-binding region in a complex with a ligand can be used for the 
development of agonists and antagonists has been described for the structure of a 
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glutamate-receptor ligand-binding core in complex with kainate (Armstrong et al., 
Nature 395 (1998), 913-917). 

The crystal of the invention, in particular when comprising nAChR related proteins 
5 can be a complex of the protein with a ligand comprising an N-alkyiated hydroxyalkyl 
and/or a quaternary ammonium ion. However, other ligands my be used as well. 
Preferred ligands comprise 4-(2-Hydroxyethyl)-1-piperazineethanesulfonic acid 
(HEPES), B-bippinatin, lophotoxin, d-tubocurarine, nicotine, acetylcholine, conotoxin, 
carbamylcholine, gatanthamine, epibatidine or alpha-bungarotoxin or derivatives 
10 thereof. 



Different aspects of X-ray crystallography are such as data collection, structure 
solution, determining the molecular structure from X-ray diffraction, refinement, etc. 
are described in the prior art, see, e.g., Powell, Annu. Rep. Prog. Chem., Sect. C: 
15 Phys. Chem. 96 (2000), 139-175 and Methods in Enzymology, 276-277, edited by 
Carter and Sweet, Academic Press, 1997. Current methods and optimization 
algorithms for the refinement of X-ray crystal structures are described by Van Der 
Maelen Uria, Crystallogr. Rev. 7 (1999), 125-180. 

The crystal of the invention effectively diffracts X-rays for the determination of the 
20 atomic coordinates of the protein or protein-ligand complex to a resolution of greater 
than 5.0, preferably greater than 4.0 Angstroms. In a preferred embodiment the 
crystal effectively diffracts X-rays for the determination of the atomic coordinates of 
the protein-ligand complex to a resolution of greater than 3.0 Angstroms, in a more 
preferred embodiment the crystal effectively diffracts X-rays for the determination of 
25 the atomic coordinates of the protein-ligand complex to a resolution of greater than 
2.0 Angstroms. In one embodiment the crystal effectively diffracts X-rays for the 
determination of the atomic coordinates of the protein-ligand complex to a resolution 
of about 2.7 Angstroms. 

30 Preferably, the crystal of the invention is formed by a protein that has an amino acid 
sequence of amino acids 20 to 223 of SEQ ID No. 2, or an amino acid sequence that 
differs from amino acid 20 to 223 of SEQ ID No. 2 by only having conservative 
substitutions. As is described in the examples, the crystals of the AChBP comprise 
decameric forms of the protein. In order to ease the use of the AChBP protein for 

35 analysis and crystallography it is envisaged to create a mutation in residue Asp2 and 
Asp5 of the mature AChBP SEQ ID No. 2 or 4 to remove the calcium binding site, 
and prevent creation of a decamer. This deletion can be done for example by 
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oligonucleotide-directed mutagenesis. Alternatively crystals could be grown in a low 
calcium concentration or in the absence of calcium. 

The crystal of the present invention preferably has (1) a space group of P2\2fo and 
a unit cell of dimensions of a=120.6A, b=137.oA and c=161 .5A; (2) a space group of 
P4 2 2 1 2 and a unit cell of dimensions of a=b=141.6A and c=120.8A or (3) a space 
group of P2 t and a unit cell of dimensions of a=121.lA, b=162.lA, c=139.4A, 
/ [3=90.1°. 

The crystal of the present invention is preferably from a protein that has secondary 
structural elements that include .alpha.-helix and antiparallel .beta.-sheets as shown 
in and described for Figures 7, 10, 11 and/or 12. Most preferably, the crystal of the 
invention has a three-dimensional structure as defined by atomic coordinates shown 
in Table 1 . Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the purpose of 
this invention, any set of structure coordinates for AChBP or AChBP mutants that 
have a root mean square deviation of protein backbone atoms (N, C.alpha., C and 0) 
of less than 0.75 Angstrom when superimposed - using backbone atoms - on the 
structure coordinates listed in Table 1 shall be considered identical. 
In a most preferred embodiment of the present invention, the crystal has a binding 
cavity as shown in Figures 6, 8, 9 and/or 13. 

In accordance with the findings of the present invention, it is proposed to use the 
water-soluble ligand-binding proteins of molluscs as the blueprint for the receptor 
binding site of the ligand-gated ion channel superfamily including nACh, 5-HT3, 
glycine, GABA A , and GABA C , most preferably for the nAChR. The availability of X-ray 
structures, and the cloned sequences provide a unique opportunity to understand 
these receptors at the molecular level, possibly unravel the dynamic changes 
occurring upon ligand binding, and predict their tertiary and quaternary structure with 
a higher degree of confidence than possible for other protein modules. This should 
pave the way for designing ligands selective for any of the multiple subtypes in any of 
these receptor families. The AChBP-like structures can be used for' computerized 
docking to homology models which leads to the a priori discovery of novel ligands 
before laboratory experiments begin to optimize the drug candidates. 
Thus, the present invention also relates to a method of using the crystal of the 
invention in a drug screening assays, such as comprising: 
(a) selecting a potential ligand by performing structure assisted drug 

design with the three-dimensional structure determined for the crystal, 
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wherein said selecting is performed in conjunction with computer 

modeling; optionally 
(b) contacting the potential ligand with the ligand binding domain of 

the ligand-gated receptor in an in vitro or in vivo assay; and 
'(c) detecting the binding of the potential ligand for the ligand binding 

domain. 

The use of macromolecular crystallography as a tool for investigating drug and 
receptor interactions, in particular structure-based drug design is reviewed in Oakley 
and Wilce, Clin. Exp. Pharmacol. Physiol. 27 (2000), 145-151. The desired drug 
could be an inhibitor or an agonist that mimics endogenous transmitters or ligands. 
Once the 3-D structure of the relevant target is known, computational processes can 
be used to search databases of compounds to identify ones that may interact 
strongly with the target. Lead compounds can be improved using the 3-D structure of 
the complex of the lead compound and its biological target. The activity of the 
selected compound can then be tested in a functional assay such as one of those 
described herein.. 

Preferably, the potential drug is selected on the basis of its having a greater affinity 
for the ligand binding domain of the ligand-gated receptor than that of a standard 
ligand for the ligand binding domain of the ligand-gated receptor. However, the 
affinity of the selected compound may also be less than that of a standard ligand. 
Such compounds are useful for example as a lead for the development of further 
analogues which in turn may have enhanced binding affinity or otherwise beneficial 
therapeutic properties. On the other hand, the selected compound may bind to a site 
of the ligand-gated receptor other than known ligands. In a preferred embodiment, 
the ligand-gated receptor is a nicotinic acetylcholine receptor. 

In a further embodiment, the method of the present invention further comprises: 

(d) forming a supplemental crystal of a protein-ligand complex by co- 
crystallization or soaking the crystal of the water-soluble ligand-binding 
protein with a potential drug, wherein the crystal effectively diffracts X-rays for 
the determination of the atomic coordinates of the protein-ligand complex to a 
resolution of greater than 5.0, preferably greater than 4.0 Angstroms, more 
preferably greater than 3; 

( e ) determining the three-dimensional structure of the supplemental crystal; 
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(f) selecting a candidate drug by performing a structure assisted drug 
design with the three-dimensional structure determined for the 
supplemental crystal, wherein said selecting is performed in 
conjunction with computer modeling; optionally 
5 (g) contacting the candidate drug with a cell that expresses the ligand- 
gated receptor; and 

(h) detecting a cell response; wherein a candidate drug is identified as a drug 
when the cell response is altered compared to a cell that has not been 
contacted with the candidate compound. 

1 0 The above described methods can further comprise an initial step that precedes step 
(a) wherein said initial step consists of determining the three-dimensional structure of 
a crystal comprising a protein-ligand complex formed between the water-soluble 
ligand-binding protein, and the ligand of the ligand-gated receptor, wherein the 
crystal effectively diffracts X-rays for the determination of the atomic coordinates of 

15 the protein-ligand complex to a resolution of greater than 5.0, preferably greater than 
4.0 Angstroms. Preferably, the resolution of crystal diffraction in the above described 
methods is at least 3.0, most preferably at least about 2.7 Angstroms. 



In a still further embodiment, the present invention relates to a method of growing a 
20 crystal of a protein-ligand complex comprising: 

(a) contacting the water-soluble ligand-binding protein described above with a 
ligand of a ligand-gated receptor, wherein the water-soluble ligand-binding 
protein forms a protein-ligand complex with the ligand; and 

(b) growing the crystal of the protein-ligand complex; wherein the 
25 crystal effectively diffracts X-rays for the determination of the 

atomic coordinates of the protein-ligand complex to a resolution 

of greater than 5.0, preferably greater than 4.0 Angstroms, more preferably at 

least 3.0, most preferably at least about 2.7 Angstroms. 



30 The crystals of the present invention can also be used in X-ray crystallography- 
driven screening technique that combines the steps of lead identification, structural 
assessment, and optimization such as described for example in Nienaber et al., 
Nature Biotechnol. 18 (2000), 1105 - 1108. This crystallographic screening method 
(named CrystaLEAD) has been used to sample. large compound libraries and 

35 detecting ligands by monitoring changes in the electron density map of the crystal 
relative to the unbound form. The electron density map yields a high- resolution 
picture of the ligand-protein complex that provides key information to a structure- 
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directed drug discovery process. The bound ligand is directly visualized In the 
electron density map. Ligands that bind away from the targeted site may be 
eliminated. 

The above described methods can be coupled with state-of-the-art laboratory data 
5 collection facilities including CCD detectors and data acquisition robotics. 

Further embodiments that may be used in accordance with the ligand-binding 
proteins and receptor of the present invention are described in the prior art, for 
example ligand screening and design by X-ray crystallography is disclosed in 
W099/45379 and W099/45389; WO00/14105 describes assaying a candidate 
10 compound for its ability to interact with a modified receptor tyrosine kinase including 
obtaining and applying crystallography coordinates to a computer algorithm for 
generating a model which is applied in an iterative process to various molecular 
structures in order to identify agonist and antagonists of the receptor. All these 
methods may be equally applied to the proteins and crystals of the present invention. 

15 

In one preferred embodiment, the present invention relates to a drug screening assay 
comprising soaking a crystal of the invention in a solution of compounds to be 
screened and detecting the binding of the compound to the ligand-binding protein. A 
possible procedure is also described in Example 9. Besides the detection methods of 
20 ligand-binding mentioned above, in the cited documents and in the examples, the 
detection can also be based on measuring the release of the ligand in the preformed 
crystal of a protein-ligand complex. As described herein before, said ligand preferably 
comprises an alkylated nitrogen and/or quaternary ammonium ion or may be one of 
those described above, 

25 

The structural information on the crystals of the present invention can also be used 
for increasing or decreasing the affinity of a drug to a ligand-gated receptor. Such a 
method can comprise performing structure assisted drug design with the three- 
dimensional structure determined for the crystal, wherein said drug design is 

30 performed in conjunction with computer modeling; and modifying said drug to alter or 
eliminate a portion thereof suspected of interacting with a binding site of the binding 
cavity or with a non-specific binding site of the protein in the crystal. This method 
can, of course, be combined with one or. more steps of any of the above described 
screening methods or other screening methods well known in the art. Methods for 

35 clinical compound discovery comprises for example ultrahigh-tHroughput screening 
(Sundberg, Curr. Opin. Biotechnol. 11 (2000), 47-53) for lead identification, and 
structure-based drug design (Verlinde and Hoi, Structure 2 (1994), 577-587) and 
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combinatorial chemistry (Salemme et al., Structure 15 (1997), 319-324) for lead 
optimization. Further information that could be taken into account for drug selection 
and design so far available for the localization of agonist and competitive antagonist 
binding sites on nicotinic acetylcholine receptors have recently been reviewed (Arias, 
5 Neurochem. Int. 36 (2000), 595-6450; Corringer et al., 1999). Once a drug has been 
selected, the method can have the additional step of repeating the method used to 
perform rational drug design using the modified drug and to assess whether said 
modified drug displays better affinity according to for example interaction/energy 
analysis. 

10 

A related method of the present invention for drug design comprises the step of using 
the structural coordinates of the water-soluble ligand-binding protein crystal 
comprising the coordinates of Table 1 , to computationally evaluate a chemical entity 
for associating with the ligand-binding site or a non-specific binding site of a ligand- 

15 binding protein. This approach, made possible and enabled by this invention, is to 
screen computationally small molecule data bases for chemical entities or 
compounds that can bind in whole, or in part, to the AChBP. In this screening, the 
quality of fit of such entities or compounds to the binding site may be judged either by 
shape complementarity or by estimated interaction energy. Meng, et al., J. Coma. 

20 Chem. 13 (1992), 505-524. In addition, in accordance with this invention, AChBP 
mutants or chimerics may be crystallized in co-complex with known ligand-gated ion 
channel inhibitors. The crystal structures of a series of such complexes may then be 
solved by molecular replacement (for review see for example Brunger er al. Prog. 
Biophys. Mol. Biol. 72 (1999), 135-155; and references cited therein) and compared 

25 with that of wild-type AChBP. Potential sites for modification within the various 
binding sites of the ligand-binding domain may thus be identified. This information 
provides an additional tool for determining the most efficient binding interactions, for 
example, increased hydrophobic interactions, between AChPB and a chemical entity 
or compound. 

30 The design of compounds that bind to or inhibit ligand-gated ion channels according 
to this invention generally involves consideration of two factors. 
First, the compound must be capable of physically and structurally associating with 
the ligand-binding domain. Non-covalent molecular interactions important in the 
association of the ligand-binding domain with its ligand include hydrogen bonding, 

35 van der Waals and hydrophobic interactions. 

Second, the compound must be able to assume a conformation that allows it to 
associate with the ligand-binding domain. Although certain portions of the compound 
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will not directly participate in this association, those portions may still influence the 
overall conformation of the molecule. This, in turn, may have a significant impact on 
potency. Such conformational requirements include the overall three-dimensional 
structure and orientation of the chemical entity or compound in relation to all or a 
portion of the binding site or the spacing between functional groups of a compound 
comprising several chemical entities that directly interact with the AChBP. 
If the theoretical structure of the given compound suggests insufficient interaction 
and association between it and AChBP, synthesis and testing of the compound is 
obviated. However, if computer modelling indicates a strong interaction, the molecule 
may then be synthesized and tested for its ability to bind to AChPB or a ligand-gated 
ion channel and functionally tested according to the methods mentioned above. In 
this manner, synthesis of inoperative compounds may be avoided. Once suitable 
chemical entities or fragments have been selected, they can be assembled into a 
single compound or inhibitor. Assembly jnay be proceed by visual inspection of the 
relationship of the fragments to each other on the three-dimensional image displayed 
on a computer screen in relation to the structure coordinates of AChBP. This would 
be followed by manual model building using software such as Quanta or Sybyl. 
Useful programs to aid one of skill in the art in connecting the individual chemical 
entities or fragments include CAVEAT (Bartlett, et al, "CAVEAT: A Program to 
Facilitate the Structure-Derived Design of Biologically Active Molecules". In Molecular 
Recognition in Chemical and Biological Problems", Special Pub., Royal Chem, Soc. 
78 (1989), 182-196); 3D Database systems such as MACCS-3D (Martin, J. Med. 
Chem. 35 (1992), 2145-2154) and HOOK (Molecular Simulations, Burlington, Mass.). 
Instead of proceeding to build an AChBP ligand in a step-wise fashion one fragment 
or chemical entity at a time as described above, AChBP binding compounds may be 
designed as a whole or "de novo" using either an empty active site or optionally 
including some portion(s) of a known ligand(s). These methods include LUDl (Bohm, 
J. ComR. Aid. Molec. Design 6 (1992), 61-78); LEGEND (Nishibata and Itai, 
Tetrahedron 47 (1991), 8985); and LeapFrog (Tripos Associates, St. Louis, Mo.). 
Other molecular modelling techniques may also be employed in accordance with this 
invention; see, e.g., Cohen, J. Med. Chem. 33 (1990), 883-894 and Navia and 
Murcko, Current Opinions in Structural Biology 2 (1992), 202-210. 
Such computer modeling is preferably performed with a Docking program (Dunbrack 
et al., Protein Sci. 6 (1997), 1661-1681 and Folding Des. 2 (1997), R27-R42). 

Methods for the identification of drugs or corresponding lead compounds in 
computational prescreen using X-ray crystal structures are described in the prior art 
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(Verlinde and Hoi, Structure 2 (1994), 577-587; Kuntz, Science 257 (1992), 1078- 
1082; Shuker et al., Science 274 (1996), 1531-1534; Fejzo et al., Chem. Biol. 6 
(1999), 755-769; WO 98/58961). The structural information can be consulted to 
efficiently optimize leads. Computational programs have been written to identify 
5 compounds ranging from very small molecules or functional groups (GRID: 
Goodford, J. Med. Chem. 28 (1985), 849-857; MCSS: Cafllsh et al., J. Med. Chem. 
36 (1993), 2142-2167) to potential lead scaffolds (DOCK: Kuntz et al., Accounts 
Chem. Res. 27 (1994), 117-123) using solved X-ray crystal structures. Another 
method computationally prescreens compound libraries and experimentally tests the 
10 individual "hits" by X-ray crystallography (Verlinde et al., J. Comput. Aided Mol. Des. 
6 (1992), 131-147) in order to decrease the size of the screening library. In addition, 
an experimental approach has been developed to find organic solvents that bind to 
active sites that may be recombined into a lead macromolecule (Allen et al., J. Phys. 
Chem. 100 (1996), 2605-2611). 

15 

Once a compound has been designed or selected by the above methods, the 
efficiency with which that compound may bind to the AChBP or a corresponding 
ligand-binding domain may be tested and optimized by computational evaluation. For 
example, a compound that has been designed or selected to function as an inhibitor 

20 must preferably demonstrate a relatively small difference in energy between its 
bound and free states (i.e., a small deformation energy of binding). Thus, the most 
efficient inhibitors should preferably be designed with a deformation energy of 
binding of not greater than about 10 kcal/mole, preferably, not greater than 7 
kcal/mole. Inhibitors may interact with the ligand-binding domain in more than one 

25 conformation that is' similar in overall binding energy. In those cases, the deformation 
energy of binding is taken to be the difference between the energy of the free 
compound and the average energy of the conformations observed when the inhibitor 
binds to the AChBP. 

30 A compound designed or selected as binding to AChBP may be further 
computationally optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target ligand-binding domain. Such non- 
complementary (e.g., electrostatic) interactions include repulsive charge-charge, 
dipoie-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic 

35 interactions between the ligand and the AChBP when the ligand is bound to AChBP, 
preferably make a neutral or favorable contribution to the enthalpy of binding. 
Specific computer software is available in the art to evaluate compound deformation 
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energy and electrostatic interaction. Examples of programs designed for such uses 
include Gaussian 92, revision C (Frisch, Gaussian, Inc., Pittsburgh, Pa.); AMBER, 
version 4.0 (Kollman, University of California at San Francisco); QUANTA/CHARMM 
(Molecular Simulations, Inc., Burlington, Mass.); and Insight II/Discover (Biosysm 
5 Technologies Inc., San Diego, Calif.). These programs may be implemented, for 
instance, using a Silicon Graphics workstation, IRIS 4D/35, IBM RISC/6000 
workstation model 550 or better a Unix workstation (SGI, Alpha, Sun, etc.) or any 
Linux PC. Other hardware systems and software packages will be known to those 
skilled in the art. 

10 Once an AChBP-bjnding compound has been optimally selected or designed, as 
described above, substitutions may then be made in some of its atoms or side 
groups in order to improve or modify its binding properties. Generally, initial 
substitutions are conservative, i.e., the replacement group will have approximately 
the same size, shape, hydrophobicity and charge as the original group. It should, of 

1 5 course, be understood that components known in the art to alter conformation should 
be avoided. Such substituted chemical compounds may then be analyzed for 
efficiency of fit to AChBP by the same computer methods described in detail, above. 
As mentioned before, the above described methods of the present invention can also 
be used as an initial drug screening assay followed by a classical drug screening 

20 assay using the biochemical assays known in the art. 

Methods for the preparation of compounds, chemical derivatives and analogues are 
well known to those skilled in the art and are described in, for example, Beilstein, 
Handbook of Organic Chemistry, Springer edition New York Inc., 175 Fifth Avenue, 
New York, N.Y. 

25 

. In one embodiment of the method of the present invention the identified drug 
prevents or promotes correct assembly of a ligand-gated ion channel. Thus, the 
selected drug may for example either interfere with the contact regions of the 
monomers of the ligand-gated ion channel or may act as a scaffold for the assembly. 
30 In the latter case, the drug may be based for example on an antibody which binds to 
the contact regions of two or more monomers when assembled and thus facilitates 
the assembly process, Preferred contact regions with respect to the AChBP and the 
related nicotinic acetylcholine receptor are given below. In a still further embodiment 
of the above described methods, the drug can be selected such as to bind to a non- 
35 specific binding site of a ligand-gated ion channel. The non-specific binding site can 
for example include those contact regions that are highly conserved between the 
monomers of the ligand-gated ion channels. 
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Once a drug has been selected in accordance with any one of the above described 
methods of the present invention, the drug or a pro-drug thereof can be synthesized 
in a therapeutically effective amount. As used herein, the term "therapeutically 

5 effective amount" means the total amount of the drug or pro-drug that is sufficient to 
show a meaningful patient benefit, i.e., treatment, healing, prevention or amelioration 
of a condition related to an ligand-gated ion channel, or an increase in rate of 
treatment, healing, prevention or amelioration of such conditions. In addition or 
alternatively, in particular with respect to pre-clinical testing of the drug the term 

10 "therapeutically effective amount" includes the total amount of the drug or pro-drug 
that is sufficient to elicit a physiological response upon its binding to its target ligand- 
gated ion channel in an non-human animal test. 

The present invention also relates to a drug produced by any one of the above 
15 described methods of the present invention, or a pro-drug thereof. Preferably, the 
drug or pro-drug thereof is present either alone or in a composition in a 
therapeutically effective amount. 

The drug obtained by a method of the present invention may be characterized by its 
interaction with the binding sites in the binding cavity defined by the coordinates of 

20 crystal structure of the protein-ligand complex; for examples of such characterization 
see, e.g., US-A-5,798,247. Preferably, the drug, for example a potential inhibitor will 
form non-covalent bonds with one or more amino acids in the active site based upon 
the crystal structure. On the other hand, the drug may bind to a contact region of the 
individual monomers of the pentameric ligand-gated receptor. For example, mukimer 

25 contact regions in Lymnaea stagnalis AChBP (SEQ ID No. 2) have been identified. 
Consecutive regions have at least every second residue involved in contacts with the 
other monomer. Contacts have been defined as 2 atoms within 4.2 angstrom 
distance in 2.7 Angstrom structure. The primary contact regions in mature AChBP 
(residues from A contacting B) are 15-21, 44-47, 85-87, 91-94, 122-124,143-146, 

30 149, 185-187 and the complementary contact regions (from B contacting A, (identical 
to residues on A contacting E) are 3-4, 7-8, 11, 37-39, 53, 75-77, 96-104, 114-118, 
1 63-1 70; see also Figure 1 4. 

Thus, in one preferred embodiment the drug of the present invention interacts with a 
ligand-gated receptor comprising a pentamer with monomers A to E, wherein the 
35 drug binds to one or more primary contact regions of a monomer (residues from A 
contacting B) defined by amino acid residues 15 to 21 , 44 to 47, 85 to 87, 91 to 94, 
1 22 to 1 24, 1 43 to 1 46, 1 49, 1 85 to 1 87 of SEQ ID No. 2 and/or to one or more of the 
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complementary contact regions of the other monomer (from B contacting A, (identical 
to residues on A contacting E) defined by amino acid residues 3 to 4, 7 to 8, 1 1 , 37 to 
39, 53, 75 to 77, 96 to 104, 1 14 to 1 18 and 163-170 of SEQ ID No. 2; or to one of the 
contact regions identified in Figure 14; or to the corresponding contact regions of the 
5 monomers of a ligand-gated ion channel. Preferably, the ligand-gated ion channel is 
the nicotinic acetylcholine receptor and the order of the monomers is ayaSp. 

Any available method may be used to construct such mode/ from the crystallographic 
and/or amino acid sequence data disclosed herein or obtained from independent 

10 analysis of crystalline AChBP proteins or other water-soluble ligand-binding proteins 
of the present invention. Such a model can be constructed from available analytical 
data points using known software packages such as HKL, MOSFILM, XDS, CCP4, 
SHARP, PHASES, HEAVY, XPLOR, TNT, NMRCOMPASS, NMRPIPE, DIANA, 
NMRDRAW, FELIX, VNMR, MADIGRAS, QUANTA, BUSTER, SOLVE, O, FRODO, 

15 RASMOL, CNS , REFMAC, ARP/WARP, XTALVIEW and CHAIN. The model 
constructed from these data can then be visualized using available systems, 
including, for example, Silicon Graphics, Evans and Sutherland, SUN, Hewlett 
Packard, Apple Macintosh, DEC, IBM, and Compaq. The present invention also 
provides for devices such as a computer system which comprises the model of the 

20 invention and hardware used for construction, processing and/or visualization of the 
model of the invention. Further embodiments provide a computer system comprising 
computer hardware and the model of the present invention. The study of the 
interaction of the candidate species with the model can be performed using available 
software platforms, including QUANTA, RASMOL, O, CHAIN, FRODO, INSIGHT, 

25 DOCK, MCSS/HOOK, CHARMM, LEAPFROG, CAVEAT (UC Berkley), CAVEAT 
(MSI), MODELLER, CATALYST, XTALVIEW and ISIS. Computer readable media 
such as floppy discs, CD ROMs, tapes, and any other storage or processing means 
comprising crystallographic and/or nucleotide/amino acid sequence data disclosed 
herein or obtained from Independent analysis of crystalline AChBP proteins or other 

30 water-soluble ligand-binding proteins of the present invention are subject of the 
present invention as well. Any one of the mentioned means and devices can 
advantageously be used for modeling an antagonist/inhibitor or agonist/activator of a 
ligand-gated receptor. 

Furthermore, the present invention relates, to the construction of theoretical three 
35 dimensional (3D) models of ligand-binding domains of ligand-gated ion channels by 
computer-assisted molecular modeling using the X-ray coordinates of the water- 
soluble ligand-binding proteins of the invention. These 3D models can correspond 
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either to the entire ligand-binding domain (-220 to 240 extracellular amino acids) or 
may be limited to the ligand-binding site. 

The concept of using 3D structures of the mollusc ligand-binding proteins for 
molecular modeling and tool for structure prediction of for example mammalian, in 
5 particular human ligand-gated ion channels gains support from the observation that 
the ligand-binding domain of vertebrate glutamate receptor channels and bacterial 
periplasmic substrate-binding proteins (PBPs) share similar 3D structures despite the 
very low sequence similarity between ionotropic glutamate receptor subunits and the 
PBPs that were used as templates (12%); for review see Paas et al. TiPS 21 (2000), 

1 0 87- 92 and ref ernces cited therein 

Thus, on the basis of a computer-assisted molecular modeling, optionally 
supplemented by for example functional studies of site-specific mutants, the crystal 
structure of the ligand-binding domain of ligand-gated ion channels and theoretical 
3D models of these domains can be predicted. In turn, these models can be used for 

15 structure assisted drug design. The predicted models may be further refined, for 
example by monitoring the effects of mutations of amino acid residues that are 
probably located in the ligand-binding site on (1) agonist-elicited channel activation 
and desensitization, (2) inhibition of channel activity by various competitive receptor 
antagonists; or (3) the binding of various ligands. Experimental setups for analyzing 

20 such effects are known to the person skilled in the art, see a(so the documents cited 
for functional assay systems of ligand-gated ion channels. 

Thus, the embodiments of the present invention enable various possibilities for 
identification and modeling new ligands of ligand-gated ion channels as well as 
modifying the ion channels themselves. Accordingly, the present invention relates to 
25 the use of the above described polynucleotides, proteins, dimers and pentamers, 
ligand-gated ion channels, vectors, host cells, antigens, antibodies, oligonucleotide 
probes, crystals, their structural coordinates and methods for screening or profiling 
putative ligands of ligand-gated receptors. 

Methods for the lead generation in drug discovery using proteins and detection 
30 methods such as mass spectrometry (Cheng et al. J. Am. Chem. Soc. 117 (1995), 
8859-8860) and some nuclear magnetic resonance (NMR) methods (Fejzo et al., 
Chem. Biol. 6 (1999), 755-769; Lin etal., J. Org. Chem. 62 (1997), 8930-8931). 

The newly identified drug obtained by a method of the present invention, i.e. an 
35 antagonist/inhibitor or agonist/activator can be used for the preparation of a 
pharmaceutical composition for the treatment of a ligand-gated ion channel mediated 
or related disorder. Such disorders are well know to the person skilled In the art. For 
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example, possible applications of agonist and antagonists to nAChRs are based .on 
their participation in complex functions such as attention, memory, and cognition, and 
their involvement in the pathogenesis of certain neuropsychiatric disorders 
(Alzheimer's and Parkinson's diseases, Tourette's syndrome, schizophrenia, 
5 depression, etc). For the majority of these disorders, the use of nAChRs' agonists 
may represent either a prophylactic (esp. for Alzheimer's and Parkinson's diseases) 
or a symptomatic treatment; for review see for example Mihailescu and Drucker- 
Colin, Arch. Med. Res. 31 (2000), 131-144. 

The medicinal chemistry and molecular biology of GABA-activated ligand-gated ion 
10 channels also in terms of agonist and antagonist structural profiles is described in 
Chebib et al., J. Med. Chem. 43 (2000), 1427-1447. 

Glycine receptors and disorders of glycinergic neurotransmission are extensively 
reviewed in Rajendra et al., Pharmacol. Ther. 73 (1997), 121-146 and Barry et al., 
Clin. Exp. Pharmacol. Physiol. 26 (1999), 935-936. 
1 5 The central role of 5-HT3 receptor in CNS disorders and 5-HT3 receptor antagonists 
are described in Bloom and Morales, Neurochemical Research 23 (1998), 653-659 
and Higgins and Kilpatrick, Expert Opin. Invest. Drugs 8 (1999), 2183-2188. 



In one embodiment, the antagonist/inhibitor is or is derived from a protein, an 
20 antigen, antibody or from a toxin of the ligand-gated ion channel. Likewise, the 
agonist/activator can be derived from a protein, an antigen, antibody or from a toxin 
of the ligand-gated ion channel. Possible starting points comprise for example 
peptide toxins, e.g., conotoxin (IMl) and alpha bungarotoxin, lophotoxins 
(Bippinatins), tubocurarine, decamethonium, alpha-cobratoxin, epibatidine, 
25 acetylcholine, choline, nicotine, carbachol, serotonin or GABA. The structure of these 
molecules together with that of the crystal of the target Iigand-binding domain can be 
used to model the compound and elucidate side chains, functional groups etc. which 
may be added, deleted or modified in order to improve for example affinity and/or 
specificty of the drug or for example make a drug which acts on a different target 
30 non-reactive with a certain ligand-gated ion channel. 

In a preferred embodiment for the uses according to the present invention, the 
ligand-gated ion channel is the nicotinic acetylcholine receptor and said mediated or 
related disorder is Tourette's syndrome, Alzheimer's disease, addiction to nicotine or 
schizophrenia. 

35 

As mentioned herein before, this is the first time it could be shown that water-soluble 
Iigand-binding proteins exists in molluscs, which closely resemble the Iigand-binding 
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domain of ligand-gated ion channel of higher mammals. It is expected that similar 
ligand-binding proteins exist in other molluscan species or even in the lineage the 
Mollusca, Protostomia, Coelomata, Bilateria, Eumetazoa, Metazoa, Fungi/Metazoa 
group. Accordingly, the present invention also relates to the use of a ligand of a 

5 ligand-gated ion channel for identifying and isolating a water-soluble ligand-binding 
protein from such species, preferably from a mollusc. Preferably, the ligand used for 
the isolation of the protein is a-bungarotoxin. The water-soluble ligand binding 
proteins obtainable from these organisms as well as derivatives that can be made in 
accordance with the teaching present herein are also subject of the present 

1 0 invention. 

Furthermore, for the first time the crystal structure of a nicotinic binding site has been 
revealed. This crystal structure shows that the molluscan AChBP is a homolog of the 
LGIC superfamily ligand binding domains. It reveals the Ig-topology, the location of 

15 the binding site at the subunit interface, the position of the MIR and the extensive 
data on the nicotinic ligand binding residues. Importantly, it gives important new 
information about the exact fold and the arrangement of the nicotinic ligand-binding 
site in three dimensions. It shows the presence of a second pocket that has been 
noticed by EM analysis. Furthermore, it clarifies the arrangement of subunits by 

20 showing the relative positioning of the principal and complementary part of the 
ligand-binding site. It provides an explanation of the role of the LGIC superfamily 
conserved residues in stabilizing the monomer structure by the formation of 
hydrophobic cores and packing of secondary structure elements and it makes clear 
how the pentamers are built up, and how weakly the pentamer interfaces are 

25 conserved between LGICs. 

This structure can be used for the numerous drug-design studies that are targeting 
the LGIC superfamily. The general structural knowledge on its folding will be 
applicable to the GABA, serotonin (5HT 3 ) and glycine receptor fields. It will help to 
understand their ligand-binding characteristics and could thus have impact on 

30 development of e.g. ariti-emetics aimed at the 5HT 3 receptor or the mood-defining 
drugs that target the GABA receptors. However, the availability of a three- 
dimensional description of the nicotinic ligand-binding site will be especially relevant 
for the design of new drugs against Alzheimers' disease, epilepsy and the addiction 
to smoking which have the neuronal nicotinic receptors as their targets. 

35 

Many embodiments and the examples feature the acetylcholine-binding protein 
(AChBP) of the invention and the embodiments generally described herein are 
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preferably related to the nicotinic acetylcholine receptor (nAChR), more preferably to 
the alpha subunit, and most preferably to the alpha 7 subunit. However, it should be 
understood that all embodiments equally apply to the other water-soluble llgand- 
binding proteins and generally to the ligand-gated ion channels mentioned herein. 
For example, the crystal structure of the AChBP can be used to model new ligands 
for the acetylcholine receptor, preferably such with inhibiting or stimulating action on 
the acetylcholine receptor. Likewise, it is possible to identify and model new ligands 
for other ligand-gated ion channels (including glycine, GABA and serotonin receptor) 
with inhibiting action. Such ligands may for example prevent correct assembly of 
ligand gated ion channels, Preferably such ligands prevent correct assembly of 
specific sub types of ligand gated ion channels. On the other hand, ligands can be 
identified and modeled that promote correct assembly of ligand gated ion channels, 
preferably of specific sub types of ligand gated ion channels. As mentioned before, 
the methods of the present invention also allow modeling inhibitors for the non- 
specific binding site of ligand gated ion channels. 

In addition, it is possible to predict and create mutants and chimeras of AChBP with 
modified assembly behaviour, modified ligand binding behavior such as with 
increased resemblance of the binding site to the acetylcholine receptor subtype on 
the primary binding site and generally with increased resemblance to particular 
ligand-gated ion channels in activity and conformational changes. In view of the 
closest relationship between AChBP and the acetylcholine receptor it is particular 
preferred to create mutants and chimeras with increased resemblance of the binding 
site to the acetylcholine receptor subtype on the secondary binding site. However, 
the prediction and creation of mutants and chimeras with increased resemblance of 
the binding site to other ligand gated ion channels subtype on the primary binding 
site or on the secondary binding site are envisaged as well. 

These and other embodiments are disclosed and encompassed by the description 
and Examples of the present invention. Further literature concerning any one of the 
antibodies, methods, uses and compounds to be employed in accordance with the' 
present invention may be retrieved from public libraries and databases, using for 
example electronic devices. For example the public database "Medline" may be 
utilized which is available on the Internet, for example under 
http://www.ncbi.nlm.nih.gov/PubWled/medline.html. Further databases and 
addresses, such as http://www.ncbi.nlm.nih.gov/, http://www.infobiogen.fr/, 
http://www.fmi.ch/biology/researchJools.html, http://www,tigr.org/, are known to the 
person skilled in the art and can also be obtained using, e.g., http://www.lycos.com. 
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An overview of patent information in biotechnology and a survey of relevant sources 
of patent information useful for retrospective searching and for current awareness is 
given in Berks, TIBTECH 12 (1994), 352-364. 

5 This disclosure may best be understood in conjunction with the accompanying 
drawings, incorporated herein by references. Furthermore, a better understanding of 
the present invention and of its many advantages will be had from the following 
examples, given by way of illustration and which are not intended as limiting, 

10 Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), Molecular 
Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY or in 
Volumes 1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular Biology, 
Current Protocols. Standard materials and methods for plant molecular work are 

15 described in Plant Molecular Biology Labfase (1993) by R.D.D. Croy, jointly 
published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific 
Publications (UK). 

20 Brief description of the drawings 



Figure 1: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences. The AChBP alignment was made using "ClustalX„1.8" 
(Thompson et al., Nucleic Acids Research 24 (1997), 4876-4882. The 

25 subsequent alignment was further processed using "Genedoc" version 

2.5.000 (Nicholas et al. (1997) Genedoc a tool for editing and 
annotating multiple sequence alignments). Identical amino acids are 
indicated with "*", equivalent amino acid with ":", and similar amino 
acids with Glycosylation sites are Asn 66 for L- AChBP and Asn 21 

30 and 26 for B-AChBP in the amino acid sequence of the respective 

mature AChBP SEQ ID No. 2 and 4, and 6 and 8, respectively. 

Figure 2: Hydrophobicity plots of the mature AChBP amino acid sequences. The 
B&L-AChBP hydrophobicity plots were made using "Protein sequence 
35 analyses" according to the method described in Kyte and Doolite (J. 

Mol. Biol. 157 (1982), 105-132). 2A: L-AChBP_T1 (SEQ ID No. 2), 2B: 
L-AChBP_T2 (SEQ ID No. 4), 2C: B-AChBP_T1 (SEQ ID No. 6), 2D: 
B-AChBP_T2 (SEQ ID No. 8). 
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Figure 3: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of the ligand-binding 
domains of the ligand-gated receptors nAChR-a7, GABA A R-p1, 5- 
5 HT3R and GlyR-crt. Sequence alignment and processing was 

performed as described for Figure 1. The accession numbers of the 
amino acid sequences used for the alignment are as follows: Human 
alpha! : Human alpha7: Y08420; Human 5HT3: CAA06442; Human 
QlyRjalpha!: S12382; Human GABAaJ)1: NPJJ00797. A similar 
10 sequence alignment can be performed with the corresponding rat 

sequences (ratnAChRa7_Q0594! , rat5HT3R_P35563, 

ratG ABARb! _P1 5431 , ratGlyRa!_p24524) which will give 
substantially similar if not identical results. 

15 Figure 4: Clustal X (1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of nAChRs. Sequence 
alignment and processing was performed as described for Figure 1. 
The accession numbers of the amino acid sequences used for the 
alignment are as follows: Human alphal : ACHUA1 ; Human alpha2: 

20 AAG23253; Human alpha3: A53956; Human alpha4: P43681 ; Human 

alphaS: P30532; Human alpha6: Q15825; Human alpha7: Y08420; 
Human alpha9: CAB65091. A similar sequence alignment can be 
performed with the corresponding rat sequences 
(ratnAChRa7_Q05941 , rriAChRa9_P43! 44, rnAChR2_P! 238, 

25 rnAChRa3_P04757, rnAChRa4_P09483) which will give substantially 

similar if not identical results. 

Figures: Clustal X .(1.8) multiple sequence alignment of AChBP amino acid 
sequences with the amino acid sequences of nAChRs alpha 1 and 7. 
30 Sequence alignment and processing was performed as described for 

Figure !. The accession numbers of the amino acid sequences used 
for the alignment are as follows: Human alphal; ACHUA1; Human 
alpha7: Y08420. A similar sequence alignment can be performed with 
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the corresponding rat sequence ratnAChRa7_Q05941 which will give 
substantially similar if not identical results. 



5 Figure 6: The pentameric structure of AChBP. a In this schematic representation 
each monomer has a different grey level. Subunits are labeled anti- 
clockwise, with A-B, B-C, C-D, D-E and E-A forming the plus and 
minus interface side, with the principal and complementary ligand- 
binding sites respectively (ball-and-stick representation), b Viewing the 
10 AChBP pentamer perpendicular to the five-fold axis. The equatorially 

located ligand-binding site (ball-and-stick representation) is highlighted 
only in the A (light) and B (dark) interface. 

Figure 7: The AChBP monomer. Ribbon representation of the AChBP monomer. 

1 5 The secondary structure starting from the N-terminus (top) towards the 

C-terminus (bottom). The monomer is viewed towards the center of 
the pentamer. In the nAChR, the top would correspond to the N- 
terminus of the ligand binding domain, pointing towards the synaptic 
cleft, while the C-terminus would be entering the membrane at the 

20 bottom, continuing into the transmembrane domain. The AChBP 

• monomer is built up mainly of B-strands, except for an N-terminal (- 
helix. It contains 14 B-strands that are organized in the two antiparallel 
B-sheets, with an immunoglobulin topology. However, in contrast to 
the classical immunoglobulin fold, the AChBP B-sheets are rotated 

25 against each other, forming a small pocket, as visible in Figure 6. 

Figure 8: The ligand-binding site at dimer interface. Ribbon representation of 
two neighboring AChBP monomers. Monomer A is shown in grey and 
monomer B in dark grey. The ligand-binding site is located at the 

30 interface between two monomers. As predicted for the nAChRs, the 

acetylcholine binding site in AChBP occurs at the. interface between 
two neighboring subunits. Similar to the model proposed for the 
nAChRs, the ligand-binding site is asymmetric, formed mainly by 
aromatic residues. Residues from mature AChBP monomer: A 

35 (TyrA89, TrpA143, TyrA185, CysA187, CysA188 and TyrA192) form 

the principal component, while residue TrpB53 from monomer B 
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creates the complementary part of the ligand-binding site. There are 
five identical ligand-binding sites in the AChBP pentamer, similar to 
the homomeric a7 neuronal receptor. 

5 Figure 9: The ligand binding site. Stereo figure showing the ligand binding site in 
AChBP, at the interface of two monomers. Residues from mature 
AChBP monomer A (TyrA89, TrpA143, TyrA185, CysA187, CysA188 
and TyrA192) form the principal component, while residue TrpB53 
from monomer B creates the complementary part of the ligand-binding 

10 site with with additional residues ArgB104, LeuB112 and MetB114. 

There are five identical ligand-binding sites in the AChBP pentamer, 
similar to the homomeric a7 neuronal receptor. 

Figure 10: Multiple sequence alignment of AChBP amino acid sequences with 
1 5 indication of secondary structure and solvent accessibility derived from 

the crystal structure. Alignment of the four molluscan AChBP 
sequences, with secondary structure and solvent accessibility of the 
Lymnea stagnalis AChBP-1 indicated from the crystal structure. The 
Figure was prepared with ESPript (Gouet et al., Bioinformatics. 15 
20 (1999), 305-308), using DSSP (Kabsch and Sander, Biopolymers. 22 

(1983), 2577-2637). Under the alignment the solvent accessibility is 
indicated, white most buried, dark blue most exposed, according to 
ESPript defaults (blue A> 0.4, cyan 0.1 <A<0.4, white A<0.1). 

25 Figure 11: Sequence alignment of AChBP with LGICs. The alignment shows only 
the N-terminal domain of the LGIC subunits and is based on a multi- 
sequence alignment of 92 full-length LGIC sequences. Abbreviations 
used, H and Tea, stand for human and Torpedo californica. Secondary 
structure elements (a:a-helix, p:>p-strand, r\: 3 10 -helix) are indicated 

30 above the sequence, in accordance with Fig 12a. AChBP shares 23% 

sequence identity with the ligand-binding domain of human ct 7 . The 
LGIC conserved residues (bold, grey background) are displayed. 
Beginning and end of the Cys-loop are indicated by a "*". Nicotinic 
receptor ligand-binding residues on the principal and complementary 

35 side are indicated. 
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Figure 12: Overview of the AChBP monomer structure, a Stereo representation of 
the AChBP monomer as viewed from outside the pentameric ring. 
Disulfide bridges are indicated in ball-and-stick representation. In a 
complete ion-channel the N-terminus would be pointing towards the 
synaptic cleft, while the C-terminus would enter the membrane at the 
bottom, continuing into the first transmembrane domain, b Topology 
diagram of the AChBP monomer. For comparison with Ig-folds the 
strands have been labeled a-g, showing the additional strand (b') and 
hairpin (f-f). In this structure, strands have been labeled 01-010 with 
loops (or turns) L1-L10 preceding each strand with the same number. 
The P5 strand is broken (05-05') with internal loop L5\ P6 also has a 
small break, but is shown continuously; (see Fig. 11). The precise 
beginnings and ends of strands may change slightly with increasing 
resolution, but the topology seen here will be highly conserved across 
the entire family of LGICs. 



Figure 13: The ligand-binding site, a Stereo representation of the ligand-binding 
site in ball-and-stick representation, showing the contribution of the 
principal A (TyrA89/aiTyr93), B (TrpA143/ctiTrp149) and C 

20 (TyrA1 85/aJyrl 90, CysA1 87/a,Cys1 92, CysA1 88/^Cysl 93, TyrA1 92 

, /ajyrl 98) and the complementary D (TrpBSS/yTrpSS, GlnB55/yGlu57) 
, E(ArgB104/yLeu109, ValB106/yTyr111, LeuB112/yTyr117, MetB114 
/yl_eu119) and F (TyrB164) 'loops\ b Stereo view of the electron 
density map displaying a HEPES buffer molecule in the ligand-binding 

25 site. This experimental density (contoured at 1 a) is derived from 

cross-crystal averaging, c Location of the principal ligand-binding 
residues on the monomer, d Location of the complementary ligand- 
binding residues on the monomer, (orientation as in Figure 6b) 



30 Figure 14: 



35 



Dimer interface a Stereo figure of the dimer interface. Representation 
of the interface residues (ball-and-stick) on a schematic secondary 
structure figure. The figure shows the plus face of subunit A and the 
minus minus face of subunit B b Dimer interface interactions. Note that 
due to the low conservation of these interfaces (Fig. 11) the actual 
interactions will not be conserved in any LGIC interface, but that in all 
receptors the topological regions are likely to form the interface. 
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Figure 15: Conservation in the LGIC superfamily. Conserved residues are 
indicated on the top, middle and bottom respectively on the monomer 
as viewed from the central pore. The hydrophilic conserved residues 
are indicated in dark. Conserved residues are indicated as viewed 
from the central pore. Hydrophobic Cluster I: residues 6, 10, 63, 65, 
71, 81, 105, 111; Cluster II: residues 20, 27, 29, 31, 58, 82, 84, 86, 
140, 150, 152, 195;. Cluster III: residues 33, 35, 38, 41, 48, 52, 125, 
138, 171, 173, 199, 201. The hydrophilic conserved residues: Asp60, 
Asp85, Asn90, Gly109, Cys123, Cys136, Lys203. Conserved residues 
in the ligand binding site: 106, 145, 192. These three and Lys203 are 
the only conserved residues without structural role in the monomer. 
Note how very few conserved residues are at the surface. Within the 
LGIC family the Cys-loop residues are also highly conserved; see 
bottom, left. 
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EXAMPLES 



EXAMPLE 1: Isolation of Lymnaea AChBP from the CNS, determination of 
5 mass and N-terminal protein sequence 

Isolation : 80 CNS of Lymnaea were homogenized in lysis buffer (PBS [16 mM 
Na2HP04, 4 mM NaH2PC>4, pH 7.4; 150 mM NaCI] 0.5% Nonidet P-40; 0.1% triton, 
0.2% tween-20) containing 1 ug/ml aprotinin, 10 ug/ml benzamidine, 0.5 ug/ml 
leupeptin, 24 ug/ml pefabloc. The CNS lysate was cleared by triplicate centrifugation 

10 at 12,000x g for 5 min. Streptavidin-coated magnetic beads (Dynal, Oslo), 5 mg, 
were saturated with a-bungarotoxin conjugated to Biotin (4 ug) (Molecular Probes, 
Oxford, UK). These beads were washed in PBS to remove excess a-bungarotoxin, 
then added to the cleared CNS lysate, and incubated for 1 h. After this, beads were 
washed 3 times in PBS to remove unbound protein. A control reaction without a- 

1 5 bungarotoxin was performed. Proteins bound to a-bungarotoxin were allowed to elute 
off in 10 \x\ of PBS containing 10-4 M nicotine for 1 h. 

Mass determination: The eluent was separated on a microcolumn LC system was 
similar to that described previously (Hsieh et al.; Anal. Chem. 70; 1998; 1847-1852). 
A commercial syringe pump (Perkin-Elmer /ABI, model 140B) was used to deliver a 

20 flow rate of 20 \i\ /min to the column. After loading of sample to the column the flow 
rate was dropped to 10 ^il /min. The eluent was then switched from 0.2% acetic acid 
to 0.2% acetic acid/ 60% acetonitril in 1 min. Electrospray mass spectra in MS mode 
were acquired on a Micromass Q-TOF quadropole time-of-flight mass spectrometer 
equipped with a Z-spray atmospheric pressure ionization source. 

25 Protein sequence analysis : For sequence analysis a-bungarotoxin binding protein 
was extracted using the same procedure, now followed by SDS-PAGE and Western 
Blotting on PDVF membrane. Sequence analysis was performed with Edman 
degradation of the 24 kDa blotted protein (apparent MW) using a protein sequencer 
(ABI, Perkin Elmer), 

30 

EXAMPLE 2: Cloning the Lymnaea AChBP cDNA sequence: PCR and 
screening of a cDNA library 

PCR cloning : A degenerate oligonucleotide was synthesized based on the amino 
acid sequence LDRADILYNI (SEQ ID No. 10), residues 1-10, of AChBP, (5'- 
35 CGGATCCGA(TC)(AC)GIGC(GATC)GA(TC)AT(ATC)(TC)T(GATC)TA(TC)AA(TC)A 
T-3'; SEQ ID No. 11), containing a BamHI restriction site, and used in combination 
with a primer on the IZAPII lambda vector. PCR was performed on a IZAP II cDNA 
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library of the Lymnaea CNS, in a 100 |xl reaction volume with 1.0 unit of Super Taq 
DNA polymerase (Boehringer Mannheim, Germany) in a DNA thermal cycler (Perkin- 
Elmer Cetus, CT) using 45 cycles of (94 °C, 20 sec; 53 °C, 30 sec; and 72 °C, 1 min. 
Amplified cDNA was digested with BamH\ and EcoRI, separated on agarose gel, and 
a product of -900 bp was cloned and sequenced. 

Library screening : Approximately 20,000 clones of the amplified lambda ZAP II CNS 
cDNA library were plated at a density of 10 5 pfu/400 cm 2 and absorbed to charged 
Nylon membranes (Boehringer Mannheim, Germany). The AChBP PCR product was 
used as a random primed probe, labeled with [alfa 32 P]dATP (specific activity >10 9 
cpm/mg). Membranes were hybridized in 6x SSC (1x SSC: 0.15 M NaCI and 0.015 M 
Na-citrate), 0.2% SDS, 5x Denhardts and 10 ug/ml herring sperm DNA at 65 °C for 
18 h. The filters were washed in 0.2x SSC, 0.2% SDS, at 65 °C for 30 min, and 
autoradiographed. Four individual cDNA clones were in vivo excised, and sequenced 
using dideoxy chain termination in both orientations. Two types of sequence were 
obtained, named L-AChBP_T1 and L-AChBP_T2, The signal sequences were 
determined with "SMART", Simple Modular Architectur Research Tool (V3.1); see 
Schultz et al., Proc. natl. Acad. Sci. USA 95 (1998), 5857-5864 and Nucleic Acids 
Res. 28 (2000), 231-234. In case of L-AChBPJTI (SEQ ID No. 2) the prediction 
could experimentally be confirmed. 

EXAMPLE 3: Lymnaea AChBP-related sequences: cloning of the Bulinus 
truncatus cDNAs 

Total RNA was isolated from Bulinus brain ganglia (CNS), and reverse transcribed 
into hexanucleotide primed cDNA. Two degenerate oligonucleotides, directed to the 
Lymnaea AChBP JT1 sequence, forward primer: 5'- 
GCGAATTCGAYACIGARWSIGGNGCNACNTG-3' (SEQ ID No. 12), reverse primer: 
5'-GCGMGCTTCRTCYTCRTAIGCYTCNGCRCARC-3' (SEQ ID No. 13), were used 
to amplify AChBP-related sequences. PCR was performed on one animal equivalent 
of CNS cDNA using 150 pmole of each primer under standard conditions for 45 
cycles (94'C, 20 sec; 54' C, 30 sec; 72 *C, 1 min). Amplified cDNA was EcoRI/Hindlll 
digested, cloned into EcoRI/Hindlll digested pBluescript, and sequenced. The ORFs 
of the obtained sequences showed a Bulinus AChBP, sequence-related to Lymnaea 
AChBP, named B-AChBP_T1 . This partial cDNA was used to screen a Bulinus brain 
cDNA library using the same hybridization protocol as described for the cloning of the 
Lymnaea cDNAs, and yielded two cDNA clones, encoding B-AChBP_T1 and B- 
AChBPJT2. Sequencing of the cDNAs was performed in both orientations. 
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EXAMPLE 4: The production of L-AChBP-T1 and -T2 and B-AChBP-T1 and -T2 
in the yeast Pichia pastoris and functional characterization 

5 Production of recombinant AChBP: 

In order to produce L-AChBP_T1 and T2 and B-AChBP_T1 and T2 as recombinant 
proteins in the Pichia pastoris expression system (Pichia Expression Kit version 3.0, 
Invitrogen), the DNA sequence encoding the mature form of these proteins (see 
sequence files) was cloned into the pPIC9 expression vector (Invitrogen). The mature 

1 0 sequences of L-AChBP_T1 , T2 and B-AChBP_T1 and T2 were PCR amplified (using 
Pf u-taq DNA polymerase (Stratagene) in order to avoid introduction of errors into the 
sequence due to PCR) and restriction sites were added to the primers to allow rapid 
pPIC9 compatible cloning. The amplified sequence of mature AChBP L-AChBP_T1 
was EcoR! inserted into pPIC9, whereas L-AChBP„T2 and B-AChBP_T1 and T2 

1 5 were Xhol/EcoRI inserted into pPIC9 (the alpha-mating factor cleavage site was fully 
reconstructed after Xhol digestion). 

Constructs with and without an additional C-terminal His-tag (SRGHHHHHH (SEQ ID 
No. 14) in the case of L-AChBP_T1, EFKDDDDKHHHHHH (SEQ ID No. 15) 
otherwise) were generated for each of the AChBP (sub)types. The AGhBP/pPIC9 

20 constructs were amplified in E. coli DH5aF and isolated and purified using the 
plasmid Maxi Kit (Qiagen). Due to the engineered cleavage site at the N-terminus of 
the amino acid sequence four additional amino acids (EAEA, SEQ ID No. 16) will 
precede the N-terminus of the original mature protein. Prior to transfection into Pichia 
pastoris the constructs were linearised (for protocol see supplier's manual; Pichia 

25 Expression Kit version 3.0, Invitrogen) and subsequently purified by 
phenol/chloroform extraction, and ethanol precipitation. Approximately 5 ^g of each 
of the linearised constructs was transformed into freshly prepared electro-competent 
Pichia pastoris cells and plated onto MD plates (for protocol see supplier's manual; 
Pichia Expression Kit version 3.0, Invitrogen corporation). Eiectrocompetent Pichia 

30 pastoris cells were aquired according to the protocol provided by Invitrogen. Plates 
were incubated at 30°C until the appearance of Pichia colonies, which were 
subsequently analysed for the presence of the correct insert by PCR amplification 
(for protocol see supplier's manual; Pichia Expression Kit version 3.0, Invitrogen). 
Colonies containing an homologous recombination with the Pichia genome, carrying 

35 the AChBP sequence, were grown in 25 ml of BMGY for 1-2 days (30°C; rotation at 
250rpm), after which the cells were centrifuged (10 min., 1500g) and the cell pellet 
was resuspended into 10 ml of BMMY. Growth (30°C, 250rpm) was continued for an 
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additional 4 days (day 3-6), during which the expression of AChBP was induced by 
the addition of 100% methanol (1% of total culture volume) once every 24 hours. At 
day seven the culture was centrifuged (15 min.; 2000g; 4°C) and the medium was 
collected. The AChBP expression level of the various cultures was determined by the 
analyses of a fraction of the collected medium with SDS-polyacrylamide gel 
electrophoresis (see suppliers manual; Pichia Expression Kit version 3.0, Invitrogen). 
The cultures that yielded the highest level of AChBP expression were selected and 
stored as glycerol stocks. 

Recombinant AChBP that contained a C-terminal His-tag was isolated and purified 
from the Picha pastoris medium using Talon metal affinity resin (according to protocol 
as described within the user manual; Clontech laboratories Inc.). The protein 
concentration was. subsequently analysed using SDS-polyacrylamide gel 
electrophoresis and reference marker proteins. Polyclonal antibodies have been 
raised succesfully to the recombinant L-AChBP_T1 and B-AChBPJ"1 proteins in 
Balb-C mice. Immune-sera were obtained without crosslinking of the proteins. 

Binding characteristics of AChBP: 

First the binding curve of a-Bungarotoxin to His-tagged AChBP was determined, and 
an affinity of 3.5 nM was calculated. Using a-Bungarotoxin in a competitive binding 
assay ligands of several types of ligand-gated ion channels were then tested on His- 
tagged AChBP, i.e., ACh, serotonin, GABA, glycine, and glutamate. Both ACh and 
serotonin did compete with a-Bungarotoxin binding at 4.2 mM and 269 mM, IC50s 
respectively. GABA, Glycine and glutamate did not compete for binding with a- 
Bungarotoxin. Thus, as predicted by the primary sequence and by subunrt structure 
also the ligand-binding characteristics of AChBP resembled that of a nAChR. 
In a second series of competitive binding assays the ligand binding characteristics of 
AChBP were studied in more detail, now using various agonists and antagonists of 
the AChRs. Nicotine a classical agonist of the nAChRs, is a high affinity ligand of His- 
tagged AChBP (IC50 98 nM). Epibatidine, a high affinity agonist of the nAChRs, also 
binds with high affinity to His-tagged AChBP (IC50 1.4 nM), which is even higher 
than the 58 pM affinity of epibatidine reported for the nAChR (Badio, Mol. Pharmacol. 
45 (1994), 563-569). Other cholinergic agonists bind with a lower affinity e.g., 
decamethonium, carbachol, and choline respectively with IC50s of 4.1 jliM, 43 \iM. 
and 190 mM. Summary of affinities indicated in Table 2. 
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Table 2 





IC50 (\xM) 


nHill ; 




IC50(jxM) 


nHIII 


serotonin 


269 ± 67 


0.65 ± 0.03 


a-cobratoxin 


16.2 ±0.1 


4.08 ± 0.30 


choline 


190±32 


0.91 ± 0.20 


atropine 


5.25 ±0.49 


1.91 ±0.23 


carbachol 


43 ±2.7 


0.67 ±0.05 


decamethonium 


4.1 ± 0.3 


1.13 ±0.09 


acetylcholine 


4.2 ±1.1 


0.72 ±0.09 


physostigmlne 


1.25 ±0.04 


0.66 ±0.07 


nicotine 


0.098 ±0.025 


0.78 ±0.05 


d-tubocurarine 


0.093 ±0.003 


0.83 ± 0.04 


epibatidine 


0.0014 ±0.0001 


0.66 ± 0.04 


gallamlne 


0.039 ±0.007 


0.71 ±0.14 








ct-bungarotoxin 


0.0026 ± 0.0006 


0.80 ±0.1 8 



5 Competition-binding of typical antagonists of the nAChRs, e.g., tubocurarine and cc- 
Bungarotoxin, have a high affinity for His-tagged AChBP, respectively IC50s of 93 
nM and 2.6 nM. The cholinergic antagonist succinylcholine has a very low affinity for 
His-tagged AChBP (IC50 7.9 mM). Interestingly, also muscarinic receptor 
antagonists bind to His-tagged AChBP with relatively high affinity, e.g., the 

10 muscarinic allosteric modulator gallamine (IC50 39 nM), and the muscarinic 
antagonist atropine (IC50 5.3 mM). Physotigmine which is a known blocker of 
acetylcholinesterase and is also an antagonist of the nAChR, binds to His-tagged 
AChBP with an IC50 of 1 .3 mM. 

Finally, Bipinnatin-B was tested, a synthetic form of the coral lophotoxin on AChBP 
15 (Groebe and Abramson, J. Biol. Chem. 270 (1995), 281-286). Bipinnatin-B is a 
general blocker of nAChRs and is known to covalently bind to Tyr-190 of the a 
subunits (Abramson, J. Biol. Chem. 263 (1988), 18568-18573). His-tagged AChBP 
was incubated with the toxin, and the mass of the protein increased with 430.1 Da, 
corresponding well to the calculated mass of Bipinnatin-B of 431 Da, indicating that 
20 the toxin also binds to Tyr-1 84 in His-tagged AChBP. 

EXAMPLE 5: Expression and purification of recombinant AChBP for 
crystallization 

The AChBP JT1 protein from Lymnea stagnalis (AChBP) was overexpressed in 
25 Pichia pastoris GS115 strain using the AOX1 gene expression system from 
Invitrogen. Media and methods used for AChBP expression are also described in 
Invitrogen manual Pichia Expression Kit. For long term storage the transformants 
were grown overnight in YPD medium at30°C. 
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YPD or Yeast Extract Peptone Dextrose medium 

1% yeast extract (Difco) 

2% peptone (Difco) 

2% dextrose (glucose) (Merck) 

The cells were harvested and suspended in YPD medium containing 15% glycerol at 

final OD600 of -50. The cells were frozen in a dry ice/ethanol bath and stored in the 

freezer (Revco) at ~80°C. Normally, the expression of AChBP started with plating the 

cells from the glycerol stock on MD plate. 

MD or Minimal Dextrose Medium 

1 .34% YNB (yeast nitrogen base w/o amino-acids) (Difco) 

4x1 0-5 % d-biotin (Sigma) 

1% dextrose 

For plates add 15g of agar (Difco) 

The plate was stored in the incubator (Heraeus) for 3-4 days at 30°C. A single colony 

was picked from the plate and inoculated in 150 ml baffled flask (Nalgene) containing 

25 ml of BMGY medium. 

BMGY or Buffered Glycerol-complex Medium 

1 % yeast extract 

2% peptone 

100 mM potassium phosphate (pH 6.0) (Merck) 
1.34% YNB 
4x10-5% d-biotin 
1% glycerol (Merck) 

The culture was placed into the shaker (New Brunswick) and left to grow overnight 
rotating at 250 rpm at 30°C. The following day 12.5 ml of the culture was inoculated 
into 225 ml of BMGY medium in a 1000 ml baffled flask. In order to increase the yield 
of expressed AChBP a larger number of flasks were used, usually 16. The flasks 
were placed in the shaker and start-cultures were rotated at 250 rpm at 30°C. After 
two days the start-cultures were centrifuged for 15 min at 2500 rpm (Sorvall RC3B+, 
rotor H-6000A) at room temperature. In order to increase the cell mass for bigger 
protein production, cell pellets of two start-culture flasks were pooled together and 
resuspended in 200 ml of BMMY medium containing 1% (w/v) casamino acids. 
BMMY of Buffered Methanol-complex Medium -*- 1% casamino acids 
1% yeast extract 
2% peptone 

100 mM potassium phosphate (pH 6.0) 
1.34% YNB 
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4x10-5%d-biotin 
0.5% methanol (Merck) 
1% casamino acids (Difco) 

The cultures were put back into the shaker (250 rpm, 30°C) and induced for the 

5 following 4 days. The concentration of methanol in the medium was kept constant by 
adding 1% (v/v) methanol to the cultures every 24 hours. After 4 days 100 ml of 
culture was harvested and the original volume of 200 ml was readjusted by adding 
fresh BMMY medium with 1% casamino acids. The remaining cultures were induced 
for another 4 days. The harvested cultures were centrifuged for 15 min at 4000 rpm 

10 (Sorvall RC3B+, rotor H-6000A) and the cell pellet was discarded. The supernatant 
was first filtered through a 0.22 pm filter (Millipore) to remove any remaining cells and 
it was concentrated using a Minitan system (Waters/Millipore) with 30kDa cutoff filter 
(Waters/Millipore). Both the filtration and concentration and were performed at 4°C. 
Finally, centrifugation at 1 6000 rpm was done (Sorvall RC5C, rotor SS-34) in order to 

15 remove any debris left after the first two steps. The final volume of concentrated 
sample was -80 ml and it was dialyzed overnight against 2 x 5 I (20 mM Tris [pH 
8.0], 150 mM NaCI and 0.02% NaN 3 ) using 15kDa cutoff dialysis membrane 
(Spectra/Por) at 4°C. The dialyzed protein solution (-100 ml) was loaded onto an 
anion-exchange column (POROS 50 HQ, Pharmacia, column volume 8 ml). After the 

20 initial wash step of -15 column volumes using loading buffer, a salt gradient of 30 
column volumes was run from 150 mM to 1000 mM NaCI. Both solutions contained 
also 20 mM Tris (pH 8.0) and 0.02% NaN 3 . The peak of interest eluted at -300 mM 
NaCI (conductivity range 16-24 mS/cm). The presence of AChBP was checked by 
Bio-Rad Protein Assay (Bio-Rad) and SDS-PAGE and the fractions of interested 

25 were pooled and concentrated using a Centriprep with a 30kDa cutoff membrane 
(Amicon). The concentrated sample (volume of 5 ml) was loaded onto a gel filtration 
column (Superdex 200 HR 16/60, Pharmacia, column volume 120 ml) using 20mM 
Tris (pH 8.0), 150 mM NaCI and 0.02% NaN 3 . The protein eluted starting from 60 to 
71 ml with peak at -66 ml. The final purification step of the protein was done on an 

30 anion-exchange column (MonoQ HR10/10, Pharmacia, column volume 6 ml). The 
protein was loaded onto the column in the same buffer as eluted from the gel filtration 
column. The salt gradient used for the column was the identical to the one used for 
the POROS 50 HQ column. The fractions in the conductivity range 25-27.5 mS/cm 
were pooled together and dialyzed against buffer containing 50 mM HEPES (pH 7.0) 

35 and 0.02% NaN 3 . The protein was concentrated up to -20 mg/ml using a Centricon 
with a 30kDa cutoff membrane (Amicon). The total yield was about 2 mg purified 
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protein per liter of expressed medium. The concentrated protein was stored at 4°C 
and used for crystallization experiments and biochemical characterization. N-terminal 
sequencing revealed the presence of EAEAYVEF residues that are part of the plC9- 
encoded signal sequence, before residue 2. The experimental mass was determined 
to be 26544 Da (MALDI), which is ~2kDa more than calculated mass based on 
amino-acid sequence (24649 Da). The difference is assigned to glycosylation of 
AChBP at position Asn66 in the mature sequence, confirmed by deglycosylation 
experiments with N-glycosidase F (Boehringer). 

The purification of the first harvest was done separately from the full harvest. They 
were pooled together prior to the last purification step (anion-exchange 
chromatography step on MonoQ column). All above mentioned chromatography 
columns were mounted on an FPLC system (Pharmacia) controlled by the UNICORN 
system (Pharmacia). All solutions used in the FPLC system were prepared with 
MilliQ UF+ water, filtered through 0.22 pm filter (Millipore) and degassed. 

EXAMPLE 6: Crystallization of the AChBP 

All the crystallization experiments were done by vapor diffusion technique in a 
hanging drop mode using 12 well tray (Nelipak) and siliconized cover slides 
(Hampton Research). Trays were placed in a sandwich box (Semadeni) and stored 
at 19°C temperature conditioned room. The initial crystallization attempts were 
performed using Hampton Crystal Screen I and II (Hampton Research). Drops 
contained 2 pi of protein (10 mg/ml in 50 mM HEPES [pH 7.0] and 0.02% NaN 3 ) and 
2 pi of reservoir solution. From the first screen it became clear that AChBP makes 
crystalline precipitate in the presence of CaCI 2 salt. A more detailed screen was 
made which produced crystals suitable for X-ray analysis. The AChBP crystals 
appeared in the following conditions: 9-11% (w/v) PEG 4000 (Hampton Research), 
100 mM HEPES (pH 7.0), 50-200 mM CaCI 2 x 6H 2 0 and 0.02% NaN 3 or or PEG 
MME 550 10-18% in the same conditions, with 0.3 mM ZnAcetate as additive. 
Depending on the batch of the protein used and the CaCI 2 concentration three 
different crystal forms were found: orthorhombic, tetragonal and monoclinic. Both 
orthorhombic and monoclinic crystal forms are frequently twinned. Orthorhombic rod- 
like crystals appeared immediately upon setting up the crystallization experiments (in 
between first few hours) under high [CaCy. The size of the crystals varied from 
0.05x0.05x0.15 to 0.25x0.25x1,0 mm. The crystals diffract X-ray up to 3 A resolution 
and show high degree of mosaicity (-05-1.2°). They have the symmetry of space 
group P2 1 2 1 2 1 with cell constants of a= 120.62A, b=137.0lA, c=161.54A with 2 
pentamer molecules per asymmetric unit. Tetragonal crystals, squared in shape, 
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grew at lower CaCI 2 concentration, reaching 0.2x0.3x0,35 mm in size. The maximal 
resolution obtained was 2.7 A with a lower mosaicity (0.5°). They belong to space 
group P4 2 2i2 space group with cell dimensions of a=b=141.66A, c=120.83A with one 
pentamer molecule per asymmetric unit. The exact crystallization condition for the 

5 tetragonal crystal which was used for refinement of the crystal structure: 1 1 .5% (w/v) 
PEG 4000, 100 mM HE PES (pH 7.0), 150 mM CaCI 2 and 0.02% NaN 3 . The third 
crystal form, monoclinic P2i, is very similar in morphology to the orthorhombic 
crystals with cell dimensions of a=121.lA, b=162.lA, c=139.4A, [3=90.13°, containing 
4 pentamers per asymmetric unit. This crystals were gave lower resolution data (~ 

10 3.3A resolution). All three crystal forms were used in the structure determination of 
AChBP. 

The resolution limit of diffraction depended very much on the size of the crystals. And 
the largest crystals diffracted weakly to ~4A resolution when exposed to a 
conventional rotating anode X-ray source. Therefore, the use of synchrotron radiation 

1 5 was critical for the structure determination. The crystals had to be cryo-protected in 
order to slow down the damage caused by high intensity synchrotron radiation. The 
cryo-protection of the AChBP crystal was done in multiple steps. The first steps 
included the stabilization of crystal by adding the 2 pi of mother liquor (equilibrated 
reservoir solution) to the drop with the crystal. After 5 minutes 3 pi of stabilizing 

20 solution was added to the drop. Normally, the stabilizing solution contained slightly 
higher concentrations (1-5%) of the components of the original crystallization buffer. 
As protectant glycerol (Merck) was added, increasing the concentration stepwise 
from 0% to 30% (v/v). For example, the starting solution contained 15% PEG 4000, 
100 mM HEPES (pH 7.0), 150 mM CaCI 2 and 0.02% NaN 3 and the final solution 

25 contained 30% (v/v) glycerol in addition to the components just mentioned. The 
AChBP crystals do not tolerate drastic increase in the glycerol concentrations 
therefore a gentle but more time consuming approach has to be adopted. The 
solution around the crystal has to be stepwise exchanged (usually 5% increase of 
glycerol concentrations) allowing crystals to equilibrate for at least 5 minutes in each 

30 glycerol concentration. Once the crystals were equilibrated in stabilizing solution with 
30% glycerol they were flash-cooled in liquid nitrogen or in the cryo-stream. In all 
three space groups AChBP forms a decamer structure with perfect 52 symmetry, 
where two pentamers contact each other through a calcium-binding site, at the lop 1 
of the ct1 helix. This binding site (Asp2 and Asp5 from two monomers) is not 

35 conserved in the LGIC family. In the tetragonal space group the 2-fold of the 
decamer coincides with a crystallography two-fold, which leads to 
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pseudocentrosymmetric behavior of the phases at low resolution. In solution the 
AChBP protein acts as pentamer. 

Those of skill in the art will appreciate that the aforesaid crystallization conditions can 
5 be varied. Such variations may be used alone or in combination, and include final 
protein (optionally in complex with a ligand) concentrations between 1 mg/ml and 30 
mg/ml; all combinations of AChPB/ligand to precipitant ratios; use of citrate 
concentrations between 0 mM and 200 mM; DTT concentrations between 0 mM and 
10 mM; and any concentration of beta-mercaptoethanol; pH ranges between 5.5 and 
10 9.5; PEG concentrations between 5% and 25% (w/v); PEG weights between 2000 
and 8000; HEPES concentrations between 5 and 500 mM; use of TRIS or other 
solutions instead of HEPES, and any concentration or type of detergent; any other 
type of precipitating agent; any other buffer; any temperature between -50 fi C and 30 
9 C; and crystallization of AChBP or complexes thereof by batch, liquid bridge, or 
1 5 dialysis method using these conditions or variations thereof. 



EXAMPLE 7: Structure determination 

The crystal structure was determined using the multiwave anomalous dispersion 
(MAD) technique on a Pb derivative, but non-crystallographic symmetry (NCS) 

20 averaging was necessary to obtain interpretable electron density. Collection of native 
data and heavy-atom derivatives were carried out at the synchrotron beam-lines in 
Grenoble (ESRF/BM14 and ID14) and Hamburg (DESY/BW7A, BW7B and X1 1). The 
AChBP orthorhombic crystal was soaked in stabilizing solution containing 5 mM 
trimethylleadacetate (MePb) for 5 days. Data sets were collected at four different 

25 wavelengths (0.9492A, 0.8610A, 0.9507A and 0.9499A) and data were integrated 
and reduced using DENZO/SCALEPACK (Otwinowski and Minor (1997) Processing 
of X-ray diffraction data collected in oscillation mode. In Methods in Enzymology, 
Volume 276: Macromolecular Crystallography, part A. C.W. Carter and R.M. Sweet, 
eds. (New York: Academic Press), pp. 307-326). The program SOLVE (Terwilliger 

30 (1997) SOLVE: An automated structure solution for MAD and MIR. Edition 1.16) 
found 5 Pb sites which were situated on the interface between two pentamers. The 
Pb parameters were refined and phases calculated with SHARP (La Fortelle et al. 
(1997) Advances in MIR and MAD phasing: Maximum-likelihood refinement in a 
graphical environment, with SHARP. Proceedings of the CCP4 study weekend). 

35 Mean figure of merit (FOM) value for 4 wavelengths was 0.45. Search and 
optimization of 5-fold NCS operators were done using programs NCS6D and IMP 
(Kleywegt and Jones (1999) Software for handling macromolecular envelopes. Acta 
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Crystallo., D55, 941-944). 10-fold averaging using refined NCS operators in 
conjunction with density modification by DM (Cowtan (1994) DM: An automated 
procedure for phase improvement by density modification. In Joint CCP4 and ESF- 
EACBM Newsletter on Protein Crystallography 31, 34-38) yielded an interpretable 
5 electron density map. However, some parts of the pentamers were still not clearly 
defined. Therefore, a second MAD experiment was performed on the monoclinic 
crystals soaked in 10 mM MePb for 5 days. Data were collected for only two 
wavelengths, at the Pb peak (0.9479A) and remote (0.9498A) wavelength. The 
processing of the two collected data sets was done with MOSFLM (Leslie (1992) 

1 0 Recent changes to the MOSFLM package for processing film and image plate data. 
In Joint CCP4 and ESF-EACBM Newsletter on Protein Crystallography, Number 26) 
and data were scaled with SCALA (CCP4. The CCP4 suite: programs for protein 
crystallography. Acta Crystallog. D50, 760-763). 10 Pb sites were identified with 
Solve. The Pb parameters were refined and phases calculated with SHARP in single 

1 5 anomalous dispersion (SAD) mode using data collected at the Pb absorption peak 
The NCS operators needed for 20-fold averaging were found by NCS6D and 
improved with IMP. 20-fold averaging and density modification by program DM 
further improved electron density. The initial model tracing and sequence assignment 
were done based on the 20-fold averaged electron density with program O (Jones et 

20 al M 1991). However, parts of the molecules were not clearly defined. The electron 
density was further improved doing multi-crystal averaging with DMMULTI (Cowtan, 
1994) using amplitudes of tetragonal, orthorhombic and native data sets and 
experimental phases of the orthorhombic and monoclinic MAD experiments. Initially 
missing parts became clearly defined and a complete model could be built. The initial 

25 atomic model was refined with the program CNS (Brunger et al. (1998) Acta 
Crystallogr. D 54, 905-921) against a maximum-likelihood target without experimental 
phases contribution using tetragonal native data which extend to 2.7A resolution. 
Refinement included five-fold NCS restraints, an overall anisotropic B factor and bulk 
solvent correction. The five-fold NCS restraints were released for the parts of the 

30 pentamer that clearly do not follow the five-fold symmetry. The current model 
contains one pentamer of AChBP consisting of 1035 residues, 14 well-ordered 
solvent molecules, 5 Ca 2+ ions, 5 CI* ions and 5 Hepes molecules, well-ordered 
solvent molecules and 5 HEPES molecules. The following residues are not well 
defined in the electron density: -8-0 (part of cc-mating S. cerevtsiae signal sequence 

35 not native to AChBP EAEAYVEF; SEQ ID No. 21), 125-135, 155-165, 186-191 and 
206-210. 
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Electron density is detectable in the ligand-binding site of AChBP. It is presumed that 
a HEPES molecule could account for this extra electron density based on its 
chemical properties. HEPES or N-2-Hydroxyethylpiperazine-N'-2-ethanesulfonic acid 
contains a quaternary ammonium ion similar to ligand such as acetylcholine (ACh) 

5 and d-tubocurarine. It has been proposed that the binding of ACh would be mediated 
by cation-it interaction involving N + and it-systems of aromatic residues present in the 
binding site of nicotinic acetylcholine receptor. Without intending to be bound by 
theory it is suggested in accordance with the observation of the present invention that 
the observed HEPES molecule mimics ligand binding analogous to the binding of 

1 0 natural ligands like ACh in the ligand-binding site., 

EXAMPLE 8: More detailed description of the structure determined in Example 7 

As described in the previous example, the crystal structure of AChBP was solved 
using weak Pb MAD data in two crystal forms. The electron density map was 
15 improved substantially by cross-crystal averaging of three crystal forms with 20, 10 
and 5 copies of the monomer in the asymmetric unit respectively (Table 3). 



Table 1: Data collection statistics 

Data set kpeak Xo remote XJnfl. jg peak X 2 infl. Native 

20 Space group P2 1 2 1 2 1 P2i P^2 

Resol.(A) 3.3/3.4-3.3 3.0/3.1-3.0 2.7/2.8-2.7 

\(A) 0.9492 0.8610 0.9507 0.9479 0.9498 0.943 

Compl. (%) 99.7/99.7 99.6/99.6 99.7/99.7 99.9/99.9 99.5/99.5 97.8/96.5 

Mosaicity(°) 0.62 0.43 0.78 

25 Redundancy 3.7/3.8 3.8/3.9 3.7/3.8 3.5/2.2 3.2/2.0 6.5 

Emerge (%) 7.7/46.8 7.8/45.2 8.3/55.0 5.9/26.1 6.0/32.9 5.9/67.4 

l/al 8.7/1.6 8.4/1.7 8.3/1.4 7.7/2.7 6.8/1.5 27.4/2.3 
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ISO/ANO ISO/ANO ISO/ANO 



ISO/ANO ISO/ANO 



Ram (%) 



0.74/0.89 n.a./0.92 0.54/0.94 



n.ayo.74 0.66/0.77 



Phasing power 0.57/1 .2 n.a./1 .06 2.3/0.91 



n.a./1 .1 



0.37/1 .22 



FOM (overall) 0.45 



0.28 



10 



15 



20 



25 



30 



The structure was refined at 2,7 A in space group P4 2 2i2, with one AChBP pentamer 
in the asymmetric unit. Thus, native data (X1 1 ) were collected and the Pb-1 data sets 
(BW7A) at the EMBL/DESY synchrotron in Hamburg and the Pb-2 data sets (BM14) 
at the ESRF, Grenoble (Table 3). Data were processed with DENZO/SCALEPACK 
(Otwinowski & Minor, Methods Enzymol. 276 f 307-326 1997) (native) or MOSFLM 
(Leslie, Acta Crystallogr. D. Biol. Crystallogr. 55, 1696-1702, 1999)/SCALA (CCP4) 
(Pb-1, Pb-2). The Pb sites, located at the interface of two pentamers, were found for 
both MAD sets by SOLVE (Terwilliger, Acta Crystallogr. 55, 849-861, 1999) and 
heavy atom parameters were optimized with SHARP (La Fortelle et al., Methods 
Enzymol. 276, 472-494, 1997). NCS operators were found and refined with NCS6D 
and IMP (Kleywegt and Jones, SERC Daresbury Laboratory, Warrington, pp. 59-66, 
1994). DM-muIti (Cowtan, Joint CCP4 and ESF-EACBM Newsletter on Protein 
Crystallography, 31, 34-38, 1994) multi-crystal averaging used amplitudes of 
monoclinic, orthorhombic and native (tetragonal) data sets and experimental phases 
of the orthorhombic and monoclinic MAD experiments. The model was built in O 
(Jones et al., Acta Crystallogr. A47, 110-119, 1991) and refined with the program 
CNS (Brunger et al., Acta Crystallogr. D54, 905-921, 1998), against the tetragonal 
data to 2.7 A resolution. Refinement included partial 5-fold NCS restraints, an overall 
anisotropic B factor and bulk solvent correction. The unusual double cysteine 
Cys187-Cys188 formed a clear disulfide bridge. Because of the limited resolution it 
was refined with standard parameters. The final model contains 1025 residues of 
AChBP pentamer, 5 HEPES molecules, 10 Ca 2+ ions and 15 water molecules. The 
entire AChBP pentamer is well ordered, except for the N-terminal 7 residues (part of 
the signal sequence) and the last five C-terminal residues. In addition, the HEPES, 
the loop region 155-160 and the sugar residues attached to residue Asn66 are not 
well resolved in the electron density. R.m.s deviations from ideal geometry for bond 
distancesand angles are 0.01 A and 1.6°, respectively. The sequence alignment was 
calculated by CLUSTALX (Thompson et al., Nucleic. Acids. Res. 25, 4876-4882, 
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1997) and the corresponding figure with Espript (Gouet et al., Bioinformatics. 15, 
305-308, 1999). Figures 2-5 were done using programs MOLSCRIPT (Kraulis, PJ., 
J. Appl. Cryst. 24, 946-950, 1991), BOBSCRIPT (Esnouf, Acta Crystallogr. D55, 938- 
940, 1999) and RASTER3D (Merritt and Bacon, Methods Enzymol. 277, 505-524, 
5 1997). Refinement took place with partial five-fold NCS restraints, resulting in an R- 
factor of 26.4% (R free = 30%). 



The AChBP pentamer: 

The AChBP homopentamer, when viewed along the five-fold axis, resembles a 
10 windmill toy, with petal-like monomers (Fig. 6a). When viewed perpendicular to the 
five-fold axis it has a disc-like appearance (Fig. 6b). The overall proportions of the 
pentamer are -80x80x62 A, and the diameter of the central hole is -18 A. These 
dimensions are in good agreement with the Torpedo nAChR N-terminal domain EM 
data (Miyazawa et al., J. Mol. Biol. 288, 765-786, 1999). The only subunit contacts in 
1 5 the AChBP pentamer are dimer interfaces, of which each monomer has two, one 
called the plus side and one called the minus side. We refer to the A (plus)-B (minus) 
interface, as example for the five equivalent interfaces AB, BC, CD, DE and EA (Fig 
6). 

20 The AChBP monomer: 

Each AChBP monomer is a single domain protein, asymmetric in shape, with a size 
of -50x21x27 A (Fig. 12a). It consists of an N-terminal p-helix, two short 3i 0 helices 
and a core of 10 p-strands forming a p-sandwich. The order of p-strands conforms to 
a modified immunoglobulin (!g) topology (Fig. 12b) with an extra p-hairpin (f'-f") and 

25 an extra strand (b') (Bork et al., J. Mol. Biol. 242, 309-320, 1994). These additional 
strands introduce two so-called "Greek key" folding motifs. The Ig-based structure 
prediction (Le Novere et al., 1999; Corringer et al., Biophys. J. 76, 2329-2345, 1999) 
agrees well with the AChBP structure, although location of the binding site was 
missed due to the presence of extra p-strands (Fig. 12b). Compared to the classical 

30 Ig-fold, the AChBP p-strands are considerably twisted, with the p-sheets rotated 
against each other, resulting in two separate hydrophobic cores. Thus the three- 
dimensional fold does not resemble other Ig-like proteins and comparison to the 
protein database (Holm and Sander, Nucleic. Acids. Res. 25, 231-234, 1997) did not 
result in a significant match to any known structure. 



35 
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Positioning of functional regions: 

Couple of regions that are important to receptor function can be localized in the 
5 AChBP structure. In muscle type nAChRs the main immunogenic region (MIR), 
comprising residues at67-ai76, acts as an epitope in the autoimmune disease 
myasthenia gravis (Tzartos et al., Mol. Neurobiol. 5, 1-29, 1991). Although the Ml fi- 
liated region in AChBP (residues 65-72) shows no sequence homology to the ar 
subunit, its location in loop L3 at the top of the pentamer in a highly accessible 

10 position agrees well with the expected accessibility for this region. It also fits with EM 
studies that located the MIR at the distal end of the receptor relative to the 
membrane (Beroukhim and Unwin, Neuron 15, 323-331, 1995). 
On each AChBP monomer, a large cavity that is accessible from the central pore of 
the pentameric ring can be seen. The cavity is framed at the entrance by p-strands 

15 ({33, p4, P5 and p5') (Fig 12a) and is uncharged, mainly hydrophobic, in character. 
This region probably corresponds to the tunnel framed by twisted p-strands that was 
observed in the a r subunit of Torpedo receptor at 4.6 A resolution (Miyazawa et al., 
J. Mol. Biol. 288, 765-786, 1999). However, this cavity is not in contact with another 
large pocket observed at each interface between subunits. These latter pockets are 

20 lined by residues shown to be involved in ligand binding in nAChR (Arias, 
Neurochem. Int. 36, 595-645, 2000; Corringer et al., Annu. Rev. Pharmacol. Toxicol. 
40, 431-458, 2000). They are buried from the solvent, and located close to the 
outside of the pentameric ring. When viewed perpendicular to the five-fold axis they 
are roughly equatorially positioned, -30 A away from the C-termini (Fig. 6b), 

25 conforming to the expected location of the Torpedo receptor ligand-binding site, as 
determined by labeling (Fernando Valenzuela et al., Biophys. J. 66, 674-682, 1994) 
and EM studies (Unwin, J. Mol. Biol. 229, 1101-1124, 1993). 

The ligand-binding site: 

30 Each ligand-binding site is found in a cleft formed by a series of loops from the 
principal face of one subunit and a series of p-strands from the complementary face 
of an adjacent subunit. It is a large cavity buried by a series of loops from the 
principal side and by a p-strands from the complementary side (Fig. 13). The 
principal side on the plus side of the AB interface consists of residues coming from 

35 'loop A' (TyrA89), loop B' (TrpA143, A145) and loop C (TyrA185, the double 
cysteine A187-A1 88, and TyrA192) (Fig 13c). The complementary part of this binding 
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site is contributed by monomer B and made of loop D' (TrpB53, GlnB55), 'loop E" 
(ArgB104, ValB106, LeuB112 and MetB114) and loop F (TyrB164) (Fig 13d). In this 
pocket four of the aromatic residues form the bottom half of the cavity (TyrA89, 
TyrA185, TyrB164 and TrpB53). The pocket walls are formed by the TyrA192, 
TrpA143, main chain of A145 Met B114, the side-chain of GlnB55 and the double 
cysteine (CysA187-CysA188). The hydrophobic parts of ArgB104, ValB106 and 
LeuB1 12 form the top of the pocket (Fig 13a). 

All residues in the pocket had been successfully identified by photoaffinity labeling 
and mutagenesis studies (Arias, Neurochem. Int. 36, 595-645, 2000; Corringer et al., 
Annu. Rev. Pharmacol. Toxicol. 40, 2000). Although the side chain of HisA145 is 
pointing away from the cavity, its main chain is involved. One residue identified by 
labeling studies, TrpA82 (cdTrp86) (Galzi et al., J. Biol. Chem. 265, 10430-10437, 
1990; Dennis et al., Biochemistry 27, 2346-2357, 1988) is involved in hydrophobic 
core formation and located far from the pocket, thus not participating in ligand 
binding. Otherwise, the AChBP ligand-binding site confirms the available biochemical 
and mutational data on nAChR completely. 

The structure, however, shows for the first time how these residues are positioned 
with respect to each other and therefore provide a valuable tool for drug design as 
described in the above description of the present invention. 

All observed residues are conserved between known nicotinic ligand-binding 
subunits except the 'loop F TyrB164 residue. The loop F region has an unusual 
conformation, but since it is relatively weakly resolved, its precise analysis is difficult. 
The 'loop F region is stabilized in the structure by a calcium binding site formed by 
AspB16l, AspB175 and the main chain of B176. This Ca 2+ ion is structurally 
important for TyrB164 orientation and could therefore be important for proper ligand 
binding. The present findings are supported by labeling studies on muscle/Torpedo 
subunits. showing that residues homologous to AspB161, yAsp174/8Asp180 play a 
role in ligand binding (Czajkowski et al., Proc. Natl. Acad. Sci. U.S.A. 90, 6285-6289, 
1993; Czajkowski and Karlin, J. Biol. Chem. 270, 3160-3164, 1995; Martin et al„ J, 
Biol. Chem. 271, 13497-13503, 1996). Additionally, calcium binding sites that 
enhance the response to agonist binding have been identified in the homologous 
region (residue range 161-172) of neuronal a 7 receptor (Galzi et al., EMBO J. 15, 
5824-5832, 1 996). The 'loop F region has low sequence conservation in the nicotinic 
family (Fig. 11) and in other superfamily members it may well have a different 
conformation, even to the extent of forming a p-strand that connects the two sheets 
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into a p-barreL Such changes could well lead to variations in affinity, e.g. by changing 
the size of the ligand-binding site or its access route. 

The most likely access routes to the ligand binding sites are from above or below the 
5 double-cysteine-containing loop C (Fig. 13a). This region buries the pocket from the 
solvent and therefore prevents access from the outside. Access from the central pore 
has been suggested in the literature (Miyazawa, J. Mol. Biol. 288, 765-786, 1 999), 
but this would require major structural rearrangements at the interface, which makes 
it less likely. 

10 

Liaand binding: 

Surprisingly, features of bulky electron density were found that stacked onto Trp143 
in each ligand-binding site in the experimental cross-crystal averaged electron 
density (Fig 13b). Upon consideration we have assigned this to a HEPES (N-2- 

15 Hydroxyethylpiperazine-N'-2-ethanesulfonic acid) buffer molecule, that contains a 
positively charged quaternary ammonium group and therefore has some similarity to 
known nicotinic receptor ligands. Its EC50 is 100 mM, indicating that its binding 
under crystallization conditions (100-150 mM) is possible. Although HEPES molecule 
does not make any specific contacts with the protein, it stacks with its quaternary 

20 ammonium onto Trp143, making cation-it interactions as expected for nicotinic 
agonists (Dougherty, Science 271, 163-168, 1996) (Fig. 13b). However, due to 
limited resolution of the present data and probable low occupancy, the precise 
orientation of the HEPES molecule should be taken with some degree of reservation. 

25 It has been suggested (Changeux and Edelstein, Neuron 21 , 959-980, 1998) that the 
ligand-binding site of nAChRs could be similar to that of acetylcholinesterase (AChE). 
Although the size of the binding site is roughly similar in AChBP and AChE, the 
observed arrangement of aromatic residues is quite different. However, the stacking 
of the quaternary ammonium of HEPES, as far as it has been refined in the current 

30 AChBP structure, is similar to that of the quaternary ammonium of the 
decamezonium in AChE on the Trp84 residue (Harel et al., Proc. Natl. Acad. Sci. 
U.S.A. 90, 9031-9035, 1993). 

Subunit arrangement: 

35 From the location of the ligand-binding site conclusions can be drawn about the 
relative arrangement of subunits in the Torpedo and muscle receptors. It has been 
suggested that the ctiy and oti5 interfaces occur in a clockwise aiyai8Pi arrangement 
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when looking towards the membrane (Machold et al., Eur. J. Biochem. 234, 427-430, 
1995). Such a clockwise arrangement disagrees with the structure determined in 
accordance with the present invention, because the relative arrangement of the 
principal binding site and its complementary partner is anticlockwise when looking 
5 towards the 'bottom' (membrane) side of the pentamer (Fig. 6). 



Pentamer interface: 

The subunit interface consists on the plus side entirely of loop regions (L1 , L2, L4, 

10 L5, L7, L8 and L10), whereas the minus side mostly presents secondary structure 
elements to the interface (a1, pi, p2, p3, (35, p6 and L9) (Fig. 14), Several residues 
are important for both ligand-binding and pentamer formation. The interface buries a 
considerably surface area (2700 A 2 ), with a mainly uncharged character including 
only a single bifurcated salt bridge (GluA149-ArgB3 and ArgB104). Most intriguing 

1 5 about the interface residues is the lack of conservation of these particular residues in 
the entire superfamily, not only with AChBP, but also amongst each other (Fig. 11). 
These changes involve major changes in character, including changes from 
hydrophobic to charged. Even when a residue is conserved in any particular subunit, 
its expected counterpart is missing (either in the same subunit, as in the a 7 

20 homopentamers, or in contacts such as muscle ai 6 or a-jy or neuronal CC4P2) with the 
sole exception of the ligand-binding site. The high level of structural conservation 
however, determines involvement of the same topological regions in these contacts 
in all family members (Fig 14b). This indicates that shape complementarity must play 
a major role in determining the conservation of the pentamer structure. It also 

25 indicates that different combinations of subunits will have different interfaces, 
creating variations in the precise allosteric contacts and movements in the various 
subclasses of these ion channels. 

Liaand-gated ion channels: 

30 The lack of conservation of the interface residues seems a general feature in the 
superfamily of LGICs, as the residues that form the interface are among the least 
conserved regions of the domains (Fig. 11). Apparently pentamer formation does not 
impose very stringent evolutionary requirements in this case. However, there is clear 
sequence conservation within the superfamily (Fig. 11) and it is interesting to analyze 

35 this in the light of the structure. 
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In the AChBP monomer structure the conserved hydrophobic residues can be 
grouped into three clusters (Fig. 15). In AChBP, as in other proteins with Ig-like fold, 
the packing of the p-sheets is promoted mainly by hydrophobic and to a lesser extent 
by electrostatic interactions. The first cluster is involved in packing of the N-termlnal 
5 helix a1 against the main framework of the monomer and it involves residues 6, 10, 
63, 65, 71 , 81 , 105 and 111. The second cluster, comprising residues 20, 27, 29, 31 , 
58, 82, 84, 86, 140, 150, 152 and 195, is situated in the upper half of the p-core 
region. The third cluster, including residues 33, 35, 38, 41, 48, 52, 125, 138, 171, 
173, 199 and 201, is located at the lower end of the structure (Fig 15). The only non- 

1 0 hydrophobic residues that are highly conserved in the superfamily are Asp60, Asp85, 
Asn90 and Gly109. Asp60 and Gly109 are involved stabilizing the turns of a Greek 
key motif connecting strands P3, p5, P6 and p2, where Asp60 stabilizes the N- 
terminus of a small 3 10 helix and Gly109 enables tight turn formation. Conserved 
residues Asp85 and Asn90 are involved in packing of the p-sheets. Asp85 forms 

15 hydrogen bonds to the highly conserved Ser142 and Thr144 and residue Asn90 
brings together the main-chain oxygens of Ser122 and Arg137, enabling disulfide 
bond formation of the nearby absolutely conserved disulfide bond (123-136). This 
disulphide bond is topologically equivalent to so-called 'tyrosine cornerstone 1 
(Hemmingsen et al., Protein Sci. 3, 1927-1937, 1994), which links the two p-sheets 

20 together in Ig-like proteins. This explains why in the Torpedo receptor the Cys128- 
Cys142 bond is important for both preservation of subunit conformational stability 
(Mishina et al., Nature 313, 364-369, 1985) and complete nAChR assembly (Green & 
Wanamaker, J. Neurosci. 18, 5555-5564, 1998). Since the observed overall 
structural conservation is high, it is clear that all LGIC N-terminal domains will have 

25 the same three-dimensional structure. 

In contrast to the above residues, the Cys-loop is a highly conserved hydrophobic 
region in the LGIC family but presents a totally different character in AChBP (Fig, 11). 

In AChBP, this loop is hydrophilic and is found at the bottom (membrane) side of the 
30 protein, at the dimer interface. This location and its hydrophobicity in the LGIC family 
implies that this loop could interact with the membrane or with the transmembrane 
region of the receptors, functions that are absent in AChBP. 

Since all ligand gated ion channels have intrinsically the same function, opening of a 
35 membrane pore, it is likely that the conserved regions of the protein determine this 
function. That also indicates that it is unlikely that the interface of the pentamer has a 
major role in opening the channel. It is possible that the conserved Cys-loop is 
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directly involved in transmitting this kind of information to the membrane part of the 
LGICs. Another option is that large structural changes in the p-sheet regions play a 
role in opening the channel. Indeed, the movement observed at 9 A for Torpedo 
nAChR upon agonist binding (Unwin, Nature 373, 37-43, 1995), fits well with such a 

5 suggestion. In accordance with the present invention a twisted p-sandwich would be 
observed, with two distinct hydrophobic cores and it is entirely possible that these 
cores move with respect to each other upon ligand binding. The effect of such 
movements will then be modulated by the varying subunit interfaces in the different 
subtypes of the receptor, allowing intricate specificity in the neuronal signal 

10 transmission. 

EXAMPLE 9: Ligand-binding crystallization studies 

AChBP was cocrystallized in complex with a-bungarotoxin (aBTX, Sigma). Prior to 
the crystallization experiments the stability of the complex has been investigated. 

15 Using gel-filtration chromatography (Superdex 200 HR 10/30, Pharmacia, column 
volume 24 ml) it has been found that it is possible to purify stable complex between 
AChBP and aBTX. The gel-filtration run was performed using 20 mM Tris (pH 8.0), 
150 mM NaCI and 0.02% NaN 3 . The stability of the complex was also confirmed with 
native PAGE. The crystallization experiments were done based on the same set of 

20 conditions found to work for AChBP alone; see Example 6. A small screen was set 
up with different precipitant concentrations and various AChBP:a BTX 
concentrations. Tiny crystals appeared in the conditions containing 10-12% PEG 
4000, 100 mM HEPES (pH 7.0), 20-80 mM CaCI 2 and 0.02% NaN 3 . The best looking 
crystals grew under above mentioned conditions when AChBP: (BTX were mixed in 

25 1:10 molar ratio. In order to check if complex indeed crystallized, crystals were 
thoroughly washed, dissolved in denaturing buffer and checked on SDS-PAGE that 
clearly showed that they contained both proteins. 

In addition, a number of small ligands were bound to AChBP in soaking experiments. 

These include: B-bippinatin (a synthetic analog of lophotoxin), acetylcholine (ACh, 
30 Sigma), d-tubocurarine chloride (Sigma), carbamylcholine chloride (CCh, Sigma), 

galanthamine hydrobromide (Sigma), epibatidine (Sigma) and nicotine (Sigma). The 

soaking solutions were made of stabilizing solutions (see Example 6) and together 

with dissolved ligands (ligands were normally dissolved in 20 mM HEPES [pH 7.0]). 

The ligand concentrations used were dependent on its binding constants, as 
35 determined by ligand-binding studies. The soaking times were different depending on 

the ligand used. After the soaking step the crystals were flash-cooled in liquid 

nitrogen. 
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EXAMPLE 10: Generating human alpha7 nAChR / AChBP chimeras 

The chimeric proteins of nAChR subunits and AChBP can be used as tools in the 
development of novel, nAChR subtype specific ligands. As a first step in developing 

5 these tools chimeric proteins have been designed and constructed in which part(s) of 
the human alpha7 nAChR were grafted into AChBP. Previous studies on the 
molecular determinants of ligand-binding by the alpha7 nAChR have identified three 
amino acid domains that compose the primary part of the ligand-binding site, further 
referred to as "loops A, B, and C'\ Within each of the three loops amino acid residues 

10 are present that are thought to directly interact with the ligand. Based on sequence 
conservation of the nAChR and AChBP the three possible ligand-binding loops of 
AChBP have been pin-pointed in accordance with the present invention as follows: 
loop A, Trp-101 -> Tyr-108; loop B, Trp-162 -> His-164; loop C, Tyr-204 ~> Tyr-211. 
The chimeric proteins that were constructed replace either one (A, B or C) or multiple 

15 (A&B, A&C, B&C and A&B&C) of the ligand-binding loops of AChBP with the 
corresponding human alpha7 nAChR sequence. 

The loop-A domain of AChBP was replaced by the corresponding domain of the 
human alpha7 nAChR using a two-step polymerase chain reaction (PCR). In the first 
step two separate PCR amplifications (35 cycles: 94 2 C;30 sec., 58 S C; 30 sec and 

20 72 S C; 60 sec.) yielded two halves of the chimera construct. AChBP cDNA (wild type) 
was used as template, and outer primers located either just before the start codon 
(gcgctcgagaaaagagaggctgaagctttggaccgggcagacatctt; SEQ ID No. 17) or just before 
the stop codon (cgcgaattcaagaatttcggagcgtccctt; SEQ ID No. 18) were each used in 
combination with two internal primers gtggaaaccagacattctcctctacaacgccatctcgaaacc 

25 (SEQ ID No. 19) and gaggagaatgtctggtttccacaaagagcttattggcac (SEQ ID No, 20), 
respectively. The internal primers contained a 5-tag-sequence that encoded for the 
introduced alpha7 nAChR domain. As such the two generated chimeric PCR 
products share a common tag containing a part of the alpha7 nAChR subunit. In the 
second step, the two PCR products from the first round were pooled and, in the 

30 absence of primers, went through 5 rounds of PCR amplification (94 S C; 30 sec, 54 
C C; 3 mm. and 72 2 C; 90 sec). This allowed the two halves of the chimera to anneal 
to each other at the common alpha7 nAChR tag. The subsequent addition of the two 
outer primers and another 35 cycles of 25 PCR amplification (94 e C; 30 sec, 58 2 C; 
30 sec. and 72 S C; 90 sec.) yielded the final chimera construct. All PCR 

35 amplifications were hot-started and performed using PFU DNA-polymerase 
(Invitrogen). The loop-A AChBP/alpha7 chimera was cloned, using Xhol/EcoRI 
restriction sites in the outer primers, into the His-tag containing yeast expression 
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vector pPIC9 (Invitrogen). Validation of the construct was achieved by DNA 
sequencing. Expression of the chimera construct was achieved according to the 
Pichia pastoris protein expression protocol of Invitrogen. 



5 As described in the examples and the description, the present invention provides 
water-soluble ligand-binding proteins derived from molluscs and analogs of ligand- 
gated ion channels, crystals thereof and their use for screening ligands of ligand- 
gated ion channels. In particular, ligand-binding proteins have been identified that are 
capable of forming multimers and are amenable to crystallization. The crystal! 

1 0 structure of one these proteins, an acetylcholine binding protein (AChBP) is provided, 
which can be used to generate 3D models of the extracellular ligand-binding domain 
of ligand-gated ion channels and thus for screening of drugs that act on these ion 
channels. Furthermore, chimeric proteins are provided that are capable of binding a 
ligand of a ligand-gated receptor, and comprising at least the amino acids of the 

15 AChBP determining solubility of the AChBP, in the same positions as in the AChBP, 
and furthermore comprising amino acids determining binding to said ligand. 
It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 

20 therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent applications, 
journal articles, abstracts, laboratory manuals, books, or other disclosures) in the 
Background of the Invention, Description, and Examples is hereby incorporated 
herein by reference. Moreover, the sequence listing is herein incorporated by 

25 reference. 
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19.169 30.638 22.746 1.00 50.09 7 
18.925 29.997 24.018 1.00 53.68 6 
18.587 28.521 23.817 1.00 53.36 6 
19.653 27.845 23.180 1.00 57.88 8 
20.201 30.150 24.823 1.00 56.00 6 
21.297 29.933 24.306 1.00 57.56 8 
20.049 30.542 26.084 1.00 57.69 7 
21.175 30.752 26.980 1.00 57.19 6 
21.227 32.198 27.460 1.00 57.32 6 
22.536 32.449 28.185 1.00.58.47 6 
21.044 33.147 26.288 1.00 56.64 6 
21.016 29.878 28.204 1.00 57.98 6 
19.938 29.815 28.787 1.00 59.53 8 
22.101 29.232 28.611 1.00 58.26 7 
22.069 28.356 29.765 1.00 58.30 6 
21.806 26.914 29.298 1.00 60.26 6 
21.881 25.975 30.361 1.00 61.39 8 
23.374 28.414 30.530 1.00 58.22 6 
24.402 27.983 30.024 1.00 62.28 8 
23.340 28.937 31.753 1.00 56.90 7 
24.548 29.002 32.572 1.00 56.09 6 
24.489 30.183 33.541 1.00 54.84 6 
24.257 31.555 32.914 1.00 55.33 6 
24.483 32.647 33.962 1.00 53.97 6 
25.201 31.731 31.737 1.00 56.27 6 
24.725 27.728 33.379 1.00 55.50 6 
23.770 27.220 33.950 1.00 56.54 8 
25.948 27.215 33.413 1.00 55.65 7 
26.270 26.018 34.183 1.00 56.27 6 
26.815 24.905 33.279 1,00 60.64 6 
25.908 24.528 32.102 1.00 66.07 6 
24.552 23.965 32.566 1.00 72.18 6 
23.611 23.670 31.373 1.00 74.46 6 
22.303 23.068 31.799 1.00-74.04 7 
27.365 26.493 35.108 1.00 54.54 6 
28.463 26.811 34.655 1.00 55.93 8 
27.079 26.564 36.401 1.00 52.21 7 
28.086 27.045 37.336 1.00 49.54 6 
27.422 27.491 38.633 1.00 46.23 6 
26.545 28.682 38.450 1.00 47.55 6 
25.230 28.536 38.035 1.00 48.89 6 
27.056 29.968 38.603 1.00 48.56 6 
24,434 29.653 37.771 1.00 47.34 6 
26.269 31.087 38.343 1.00 45.87 6 
24.958 30.927 37.926 1.00 47.81 6 
29.224 26.072 37.595 1.00 49.37 6 
29.020 24.880 37.833 1.00 48.82 8 
30.434 26.609 37.530 1.00 47.17 7 
31.634 25.820 37.706 1.00 45.02 6 
32.641 26.106 36.574 1.00 42.09 6 
33.858 25.230 36.717 1.00 38.48 6 
31.966 25.897 35.224 1.00 41.05 6 
31.393 24.520 35.028 1.00 41.45 6 
32.284 26.141 39.029 1.00 45.51 6 
32.977 25.304 39.603 1.00 45.83 8 
32.068 27.351 39.522 1.00 45.06 7 
32.678 27.-719 40.792 1.00 45.81 6 
34.200 27.712 40.652 1.00 43.65 6 
34.900 27.384 41.950 1.00 47.86 6 
34.518 27.864 43.025 1.00 49.43 8 
35.942 26.572 41.862 1.00 46.99 7 
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17.678 
16.879 
16.999 
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20.618 
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1.00 55.13 
1.00 56.72 
1.00 58.46 
1.00 59.64 
1.00 60.59 
1.00 61.80 
1.00 63.04 
1.00 58.38 
1.00 58.33 
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ATOM 4355 CA ARG C 137 

ATOM 4356 CB ARG C 137 

ATOM 4357 CG ARG C 137 

ATOM 4358 CD ARG C 137 

ATOM 4359 NE ARG C 137 

ATOM 43 60 CZ ARG C 137 

ATOM 4361 NH1 ARG C 137 

ATOM 43 62 NH2 ARG C 137 

ATOM 43 63 C ARG C 137 

ATOM 4364 O ARG C 137 

ATOM 4365 N ILE C 138 

ATOM 4366 CA ILE C 138 

ATOM 4367 CB ILE C 13 8 

ATOM 4368 CG2 ILE C 138 

ATOM 4369 CGI ILE C 138 

ATOM 4370 CDl ILE C 138 

ATOM 4371 • C ILE C 13 8 

ATOM 4372 O ILE C 13 8 

ATOM 4373 N LYS C 139 

ATOM 4374 CA LYS C 13 9 

ATOM 4375 CB LYS C 139 

ATOM 437 6 CG LYS C 139 

ATOM 4377 CD LYS C 139 

ATOM 437 8 CE LYS C 139 

ATOM 4379 NZ LYS C 139 

ATOM 4380 C LYS C 139 

ATOM 4381 O LYS C 139 
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22.642 64.580 59.335 1.00 86.47 6 
22,862 65.788 59.024 1.00 87.43 8 
23.168 63.998 60.320 1.00 85.12 8 
21.207 63.185 54.715 1.00 77.86 6 
20.460 62.560 53.955 1.00 76.34 8 
21.771 64.355 54.428 1.00 76.46 7 
21.652 64.995 53.136 1.00 75.54 6 
21.941 66.495 53.290 1.00 76.21 6 
23.233 66.719 53.841 1.00 78.77 8 
22.673 64.318 52.205 1.00 74.37 6 
22.799 64.675 51.026 1.00 74.56 8 
23.392 63.338 52.764 1.00 72.74 7 
24.389 62.591 52.019 1.00 70.01 6 
25.435 63.449 51.337 1.00 68.98 6 
25.636 64.615 51.686 1.00 69.00 8 
26.107 62.859 50.355 1.00 66.70 7 
27.131 63.563 49.609 1.00 64.10 6 
28.394 62.723 49.531 1.00 63.57 6 
26.641 63.899 48.212 1.00 62.41 6 
25.640 63.360 47.737 1.00 60.16 8 
27.347 64.826 47.573 1.00 62.33 7 
27.023 65.237 46.211 1.00 62.11 6 
26.431 66.642 46.166 1.00 61.79 6 
25.253 66.675 46.980 1.00 65.07 8 
26.057 67.007 44.746 1.00 60.95 6 
28.292 65.181 45.375 1.00 60.85 6 
29.181 66.040 45.473 1.00 61.27 8 
28.368 64.128 44.574 1.00 58.77 7 
29.499 63.891 43.712 1.00 56.58 6 
29.140 64.393 42.325 1.00 56.26 6 
28.197 63.907 41.710 1.00 56.21 8 
29.794 62.396 43.698 1.00 54.70 6 
31.010 61.882 42.454 1.00 52.66 16 
29.874 65.386 41.843 1.00 55.40 7 
29.605 65.938 40.520 1.00 55.61 6 
29.698 67.466 40.537 1.00 56.53 6 
28.713 68.135 41.462 1.00 61.72 6 
29.231 69.491 41.947 1.00 65.19 6 
28.632 69.871 43.236 1.00 69.78 7 
27.352 70.221 43.412 1.00 71.88 6 
26.504 70.256 42.384 1.00 74.24 7 
26.908 70.522 44.626 1.00 70.49 7 
30.604 65.392 39.522 1.00 55.23 6 
31.807 65.381 39.773 1.00 57.26 8 
30.095 64.948 38.385 1.00 52.01 7 
30.922 64.398 37.333 1.00 50.01 6 
30.529 62.928 37.061 1.00 49.70 6 
31.361 62.360 35.933 1.00 47.04 6 
30.703 62.090 38.327 1.00 48.47 6 
30.080 60.706 38.225 1.00 46.09 6 
30.693 65.222 36.070 1.00 48.82 6 
29.571 65.322 35.597 1.00 46.40 8 
•31.752 65.814 35.529 1.00-49.18 7 
31.634 66.614 34.309 1.00 52.75 6 
32.364 67.950 34.464 1.00 54.78 6 
31.952 68.737 35.697 1.00 59.35 6 
32.477 70.168 35.669 1.00 60.96 6 
31.780 71.019 34.594 1.00 62.06 6 
32.316 72.419 34.559 1.00 59.27 7 
32.218 65.886 33.106 1.00 52.07 6 
33.364 65.446 33.143 1.00 52.64 8 
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ATOM 4483 CB VAL C 152 

ATOM 4484 CGI VAL C 152 

ATOM 4485 CG2 VAL C 152 

ATOM 4486 C VAL C 152 

ATOM 4487 O VAL C 152 

ATOM 4488 N ASP C 153 

ATOM 4489 CA ASP C 153 

ATOM 4490 CB ASP C 153 

ATOM 4491 CG ASP C 153 

ATOM 4492 ODl ASP C 153 

ATOM 4493 OD2 ASP C 153 

ATOM 4494 C ASP C 153 

ATOM 4495 O ASP C 153 

ATOM 4496 N PRO C 154 

ATOM 4497 CD PRO C 154 

ATOM 4498 CA PRO C 154 

ATOM 4499 CB PRO C 154 

ATOM 4500 CG PRO C 154 

ATOM 4501 C PRO C 154 

ATOM 4502 O PRO C 154 

ATOM 4503 N THR C 155 
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31.055 75.610 21.978 1.00 57.36 6 

32.017 76.635 21.828 1.00 59.18 8 

30.972 74.660 19.682 1.00 57.72 6 

29.790 74.484 19.375 1.00 57.25 8 

31.885 75.065 18.809 1.00 58.23 7 

31.517 75.336 17.424 1.00 60.12 6 

32.555 76.264 16.777 1.00 63.75 6 

32.799 77.567 17.549 1.00 70.83 6 

33.950 78.393 16.946 1.00 77.31 6 

34.422 79.453 17.852 1.00 84.18 7 

33.696 80.506 18.245 1.0086. 23 6 

32.447 80.661 17.818 1.00 87.64 7 

34.213 81.408 19.072 1.00 86.26 7 

31.390 74.051 16.601 1.00 59.83 6 

30.786 74.053 15.519 1.00 60.30 8 

31.954 72.959 17.121 1.00 57.69 7 

31.937 71.677 16.425 1.00 54.67 6 

33.364 71.132 16.321 1.00 52.47 6 

34.395 72.228 16.050 1,00 52.11 6 

35.824 71.718 15.896 1.00 50.20 6 

36.246 70.852 16.678 1.00 49.43 8 

36.537 72.203 15.004 1.00 47.62 8 

31.043 70.698 17.162 1.00 53.64 6 

30.252 69.985 16.552 1.00 53.16 8 

31.172 70.667 18.479 1.00 52.16 7 

30.353 69.795 19.289 1.00 51.98 6 

31.157 68.612 19.883 1.00 53.49 6 

30.361 67.954 21.019 1.00 52.00 6 

31.450 67.571 18.800 1.00 53,48 6 

32.235 66.383 19.299 1.00 50.44 6 

29.750 70.565 20.446 1.00 53.26 6 

30.410 71.389 21.095 1.00 51.01 8 

28.479 70.280 20.694 1.00 54.99 7 

27.749 70.887 21.797 1.00 56.94 6 

26.693 71.873 21.280 1.00 54.85 6 

25.665 71.221 20.557 1.00 55.92 8 

27.084 69.717 22.516 1.00 59.00 6 

26.536 68.820 21.871 1.00 60.03 8 

27.172 69.705 23.844 1.00 60.32 7 

26.569 68.651 24.655 1.00 61.16 6 

27.564 68.096 25.694 1.00 60.14 6 

28.858 67.723 25.009 1.00 61.98 6 

27.817 69.110 26.775 1.00 61.23 6 

25,373 69.237 25.391 1.00 62.82 6 

25.379 70.409 25.758 1.00 63.73 8 

24.349 68.425 25.614 1.00 65.81 7 

23.147 68.887 26.293 1.00 67.23 6 
22.150 69.377 25.249 1.00 69.19 6 
22,748 70.425 24.320 1.00 72.38 6 
22.786 71.614 24.718 1.00 72.90 8 
23.193 70.060 23.201 1.00 75.01 8 
22.505 67.776 27.120 1.00 69.10 6 
•22.504 66.612 26.717 1.00 68.47 8 
21.970 68.115 28.304 1.00 70.49 7 
22,282 69.309 29.094 1.00 69.97 6 
21.325 67.098 29.147 1.00 73.18 6 

21.148 67.814 30.477 1.00 70.84 6 
22.300 68.745 30.493 1.00 71.35 6 
19.985 66.669 28.522 1.00 77.25 6 
.19.591 67.183 27.462 1.00 77 .76 8 
19.279 65.744 29.175 1.00 80.57 7 
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56.896 

56.663 

58.756 

58.391 

59.972 

60.943 

61.916 

61.213 

62.264 

60.303 

61.680 

61.734 

62.223 

62.951 

64.334 

65.160 

64.650 

66.338 

63 .084 

63.031 

63 .237 

63.391 

62.109 

62.345 

60.943 



48.699 

48.621 
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50.438 

51.195 

51.024 

51.424 

50.597 

49.326 

51.039 

50.109 

51.215 

48 .968 

48.982 

48.324 

48.814 

48.400 

49.724 

48.890 

50.218 

49.799 

48.311 

47.761 

48.376 
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49.981 
50.992 
50.557 
52.216 
47 ,162 
47.600 
46.154 
45.494 
43 .970 
43.334 
43.384 
41.902 
45.977 
46.021 
46.321 
46.782 
47.756 
48.856 
49.723 
49.684 
45.564 
45.329 
44.768 
43.576 
43.991 
42 .812 
42.011 
42.695 
42.593 
42 .980 
41.318 
40.267 
39.401 
38.232 
40.254 



1.00 60.78 
1.00 59.44 
1.00 58.39 
1.00 59.12 
1.00 61.61 
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1.00 66.40 
1.00 68.28 
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1.00 68.62 
1.00 59.25 
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.00 53.29 
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1.00 64.82 
1.00 65.38 
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,00 58.96 
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1.00 59.99 
1.00 57.81 
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1.00 62.65 
1.00 62.58 
1.00 63.64 
1.00 63.25 
1.00 61.62 
1.00 64.25 
1.00 65.71 
1.00 65.23 
1.00 66.02 
1.00 67.24 
1.00 68.54 
1.00 69.53 
1.00 66.59 
1.00 66.29 
1.00 66.20 
1.00 66.07 
1.00 65.83 
1.00 66.59 
1.00 64,35 
1.00 63.70 
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19.869 
18.715 
20.786 
20.482 
20.215 
21.310 
18.932 
21.640 
22.802 
21.312 
22.320 
22 .150 
21.560 
21.896 

21,745 

22 .346 
22.149 
21.070 
23 .214 
23.166 

23 .205 
22 .291 
22.499 
21.814 
22.363 
24.384 
25 .504 
24.180 
25.306 
24.833 
24.008 
24.908 
25.645 
24.768 
25.971 
25.552 
27.027 
27.698 
27.967 
28. 580 
29.508 
28.071 
28.977 
29.937 
28.978 
30.134 
29.726 
30.731 
31.230 
30.941 
32.483 
33.613 
33.872 
33.648 
35.292 
34.925 
35.305 
35.616 
36.892 
36.995 
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64.532 
64.589 
65.447 
66.573 
67.861 
68.109 
67.718 
66,813 
66.641 
67.207 
67.470 
66.500 
65.148 
64.046 
64.237 
62.889 
68.884 
69.244 
69.675 
71.054 
72.022 
71.610 
72.459 
71.821 
70.452 
71.301 
71.353 . 
71.466 
71.719 
71.667 
72.846 
73.920 
73 .452 
73.295 
73.075 
73.803 
73.399 
74.674 
74.948 
73.770 
73.137 
73 .481 
74.884 
74.153 
75.911 
76.248 
77.290 
77.477 
76.798 
77.272 
76.731 
77.211 
76.264 
74.843 
76.442 
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78.485 
78.722 
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28.233 
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26.214 
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30.231 
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30.696 
30.277 
31.209 
28.728 
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1.00 67.64 
1.00 68.63 
1.00 69.48 
1.00 69.26 
1.00 68.92 
1.00 68.40 
1.00 70.44 
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1.00 73.43 
1.00 76.20 
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00 84.40 
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00 72.16 
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00 79.07 
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1.00 63.90 
1.00 63.70 
1.00 63.89 
1.00 62.80 
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38,370 
38.032 
37 .920 
39.118 
40.236 
40.555 
39.351 
38.363 
37.224 
39.164 
38.017 
37 .057 
35.920 
41.426 
41.625 
42 .220 
43 .405 
44.183 
44.398 
44 .300 
44.926 
44.332 
45.116 
44.658 
45.443 
44.899 
43.205 
43.345 
42.727 
41.779 
40.785 
42.037 
42.984 
42.070 
43 .222 
41.378 
41.921 
43 .328 
39.835 
39.233 
39.201 
37.734 
37.305 
38.277 
38.082 
36.908 
39.106 
37.084 
37.762 
35.764 
35.035 
33.755 
■34.702 
34.423 
34.717 
34.409 
35.621 
36.737 
37.694 
38.680 
36.800 



80.406 

78.470 

78.802 

77.869 

77.572 

76.067 
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74.258 

74.597 

73.839 

73 .678 
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78.869 

79.653 
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76.648 

76.703 

76.687 
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74.462 

76.774 

76.642 

77.755 

78.065 

75.269 

73 .940 

78.346 

77.849 

79.431 

79.283 

78.858 

79.468 
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78.404 

77.497 

77.434 
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77.889 

77.596 
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77.281 

78.075 

75.796 
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73.887 
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200 22.285 64.232 

200 21.985 64.387 

200 22.905 64.419 

200 20.700 64.480 

200 22.898 61.801 

200 21.821 61.316 

201 23.739 61.181 
201 23.454 59.867 
201 24.169 58.807 
201 25.663 58.820 
201 26.312 57.763 
201 26.413 59.912 
201 27.689 57.794 
201 27.782 59.955 
201 28.425 58.889 
201 23.979 59.812 

201 24.640 60.739 

202 23.698 58.702 
202 24.140 58.511 
202 23.192 59.224 
202 21.844 58.529 
202 20.831 59^.278 
202 19.555 58.567 
202 18.430 59.062 
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202 17.315 58.349 
202 24.116 57.023 

202 23.439 56.231 
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204 21.742 53.460 

204 22.711 52.854 

205 20.811 54.022 
205 20.898 53.921 
205 19.797 53.078 
205 18.911 52.614 
205 19.811 52.879 

1 39.182 71.754 

1 40.239 71.385 

1 40.122 70.241 

1 41.397 72.176 

1 41.141 69.886 

1 42.418 71.835 

1 42.289 70.688 

1 37.071 70.999 

1 37.607 71.392 

1 37.010 72.284 
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2 35.915 70.338 
2 35.253 70.061 
2 33,949 69.266 
2 34.138 68.032 
2 35.029 67.218 
2 33.396 67.886 
2 36.181 69.310 

2 37.378 69.165 

3 35.639 68.837 
3 36.461 68.128 
3 35.748 68.078 
3 36.068 69.254 
3 35.185 69.243 
3 35.849 69.838 
3 36.028 71.141 
3 35.591 72.031 
3 36.664 71.553 
3 36.831 66.719 

3 37.938 66.252 

4 35.909 66.050 
4 36.153 64.699 
4 34.938 64.175 
4 37.347 64.696 
4 38.225 63.851 
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I. 321 1.00 69.54 6 
2.678 1.00 67.84 7 
3.967 1.00 66.03 6 
3.772 1.00 66.47 6 
2.928 1.00 68.89 6 
3,287 1.00 68.16 8 
1.912 1.00 70.47 8 
4.933 1.00 64.63 6 
4.672 1.00 64.43 8 
6.049 1.00 62.26 7 
7.029 1.00 60.44 6 
8.388 1.00 60.90 6 
9.302 1.00 60.80 6 

10.532 1.00 65.18 6 

II. 696 1,00 66.90 7 
11.888 1.00 66.63 6 
11.002 1.00 68.16 7 
12.964 1.00 66.33 7 

6.580 1.00 59.10 6 
6.845 1.00 57.28 8 
5.891 1.00 56.67 7 
5.414 1.00 53.48 6 
4.706 1.00 52.26 6 
4.479 1.00 53.39 6 
4.600 1.00 52.40 8 
3.550 1.00 54.53 7 
2.602 1.00 55.71 6 
1.627 1.00 58.22 6 
0.881 1.00 61.46 6 
0.386 1.00 64.32 8 
0.773 1.00 63.22 8 
3.326 1.00 54.28 6 
2.914 1.00 52.86 8 
4.410 1.00 52.53 7 
5.154 1.00 53.69 6 
6.191 1.00 55.79 6 
6.877 1.00 54.61 6 
5.499 1.00 55.83 6 

6.400 1.00 55.01 6 
5 .870 1.00 52.43 6 
5.709 1.00 52.78 8 
6.666 1.00 51.01 7 

7.401 1.00 50.01 6 
8.276 1.00 47.85 6 
9.408 1.00 46.02 6 

10.010 1.00 48.59 6 
10.440 1.00 45.91 6 
6.415 1.00 51.80 6 
6. 631 1.00 52.35 8 
5.315 1.00 52.55 7 
4.288 1.00 53.16 6 
3.159 1.00 54.45 6 
2.006 1.00 57.19 6 
2.120 1.00 57.12 6 
1.069 1.00 60.62 6 
0.810 1.00 58.57 6 
-0.247 1.00 60.35 6 
-0.112 1.00 61.47 6 
-1.149 1.00 62.53 8 
3.735 1.00 53.38 6 
3.591 1.00 53.74 8 
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77.930 26.212 8.533 1.00 94.96 8 
75.657 28.917 11.116 1.00 84.76 6 
75.551 28.466 12.259 1.00 84.92 8 
76.097 30,146 10.857 1.00 82.51 7 
76.465 31.063 11.930 1.00 80.45 6 
77.208 32.289 11.382 1.00 83.04 6 
78.635 32.032 10.918 1.00 87.73 6 
79.370 33.358 10.688 1.00 91.97 6 
80.781 33.171 10.340 1.00 95.94 7 
81.667 34.164 10.199 1.00 97.73 6 
81.285 35.435 10.379 1.00 98.12 7 
82.938 33.892 9.880 1.00 97.12 7 
75.256 31.557 12.708 1.00 76.54 6 
74.265 31.999 12.122 1.00 76.16 8 
75.322 31.483 14.045 1.00 73.86 7 
76.343 30.778 14.834 1.00 73.05 6 
74.231 31.936 14.916 1.00 70.73 6 
74.647 31.425 16.295 1.00 70.99 6 
75.529 30.257 15.984 1.00 72.42 6 
74.199 33.466 14.891 1.00 67.63 6 
75.173 34.110 14.493 1.00 66.91 8 
73.076 34.046 15.288 1.00 63.36 7 
72.980 35.487 15.346 1.00 57.94 6 
71.537 35.949 15.266 1.00 55.79 6 
71.403 37.366 15.780 1.00 54.61 6 
71.082 35.870 13.839 1.00 57.04 6 
73.554 35.821 16.706 1.00 57.83 6 
73.180 35.207 17.711 1.00 58.05 8 
74.490 36.760 16.744 1.00 55.79 7 
75.087 37.130 18.014 1.00 55.59 6 
76.508 37.588 17.810 1.00 54.25 6 
74.270 38.224 18.677 1.00 54.76 6 
74.244 39.370 18.216 1.00 54.28 8 
73.596 37.850 19.759 1.00 53.10 7 
72.769 38.783 20.514 1.00 53.62 6 
71.338 38.222 20.767 1.00 52.63 6 
70.531 39.210 21.591 1.00 47.51' 6 
70.641 37.940 19.446 1.00 52.51 6 
73.412 39.051 21.865 1.00 53.21 6 
73.760 38.119 22.599 1.00 53.58 8 
73.583 40.325 22.184 1.00 51.51 7 
74.154 40.681 23.459 1.00 54.05 -6 
75.288 41.690 23.276 1.00 52.43 6 
74.821 42.854 22.632 1.00 55.74 8 
73.024 41.277 24.301 1.00 56.57 6 
72.236 42.110 23.825 1.00 54.93 8 
72.946 40.824 25.550 1.00 58.33 7 
71.934 41.280 26.493 1.00 60.16 6 
71.058 40.118 26.966 1.00 59.85 6 
69.842 40.662 27.700 1.00 60.72 6 
70.653 39.264 25.783 1.00 60.11 6 
72.599 41.891 27.724 1.00 61.00 6 
.73.542 41.327 28.279 1.00 62.05 8 
72.092 43.037 28.160 1.00 61.21 7 
72.648 43.717 29.318 1.00 60.47 6 
73.688 44.743 28.851 1.00 61.09 6 
74.162 45.553 29.919 1.00 63.13 8 
71.552 44.422 30.111 1.00 60.62 6 
70.941 45.375 29.621 1.00 62.55 8 
71.300 43.962 31.332 1.00 58.26 7 
70.282 44.591 32.165 1.00 56.62 6 
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67.827 
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1.00 88.94 
1.00 88.58 
1.00 65.48 
1.00 65.84 
1.00 64.00 
1.00 61.93 
1,00 62.28 
1.00 65.24 
1.00 61.73 
1.00 59.03 
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ATOM 7127 CG HIS E 69 

ATOM 7128 CD2 HIS E 69 

ATOM 7129 ND1 HIS E 69 

ATOM 7130 CE1 HIS E 69 

ATOM 7131 NE2 HIS E 69 

ATOM 7132 C HIS E 69 

ATOM 7133 0 HIS E 69 

ATOM 7134 N SER E 70 

ATOM 7135 CA SER E 70 

ATOM 7136 CB SER E 70 

ATOM 7137 OG SER E 70 

ATOM 7138 C SER E 70 

ATOM 7139 O SER E " 70 

ATOM 7140 N PRO E 71 

ATOM. 7141 CD PRO E 71 

ATOM 7142 CA PRO E 71 

ATOM 7143 CB PRO E 71 

ATOM 7144 CG PRO E 71 

ATOM 7145 C PRO E 71 

ATOM 7146 O PRO E 71 

ATOM 7147 N ASP E 72 

ATOM 7148 CA ASP E 72 

ATOM 7149 CB ASP E 72 

ATOM 7150 CG ASP E 72 

ATOM 7151 OD1 ASP E 72 

ATOM 7152 OD2 ASP E ■ 72 

ATOM 7153 C ASP E 72 

ATOM 7154 O ASP E 72 

ATOM 7155 N GLN E 73 

ATOM 7156 CA GLN E 73 

ATOM 7157 CB GLN E 73 

ATOM 7158 CG GLN E 73 

ATOM 7159 CD GLN E 73 

ATOM 7160 OEl GLN E 73 

ATOM 7161 NE2 GLN E 73 

ATOM 7162 C GLN E 73 

ATOM 7163 O GLN E 73 

ATOM 7164 N VALE 74 

ATOM 7165 CA VAL E 74 

ATOM 7166 CB VAL E 74 

ATOM 7167 CGI VAL E 74 

ATOM 7168 CG2 VAL E 74 

ATOM 7169 C VAL E 74 

ATOM 7170 O VAL E 74 

ATOM 7171 N SER E 75 

ATOM 7172 CA SER E 75 

ATOM 7173 CB SER E 75 

ATOM 7174 OG SER E 75 

ATOM 7175 C SER E 75 

ATOM 7176 O SER E 75 

ATOM 7177 N VAL E 76 

ATOM 7178 CA VAL E 76 

ATOM 7179 CB VAL E 76 

ATOM 7180 CGI VAL E 76 

ATOM 7181 CG2 VAL E 76 

ATOM 7182 C VAL E 7 6 

ATOM 7183 O. VAL E 76 

ATOM 7184 N PRO E 77 

ATOM 7185 CD PRO E 77 

ATOM 718 6 CA PRO E 77 

ATOM 7187 CB PRO E 77 
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75.366 62.083 0.283 1.00 77.46 6 
74 568 62.857 -0.491 1.00 76.01 6 
76 390 62.898 0.720 1.00 75.50 7 
76 212 64.114 0.239 1.00 74.91 6 
75.115 64.117 -0.497 1.00 77.04 7 
73.748 60.641 2.716 1.00 75.96 6 
72.954 61.408 2.170 1.00 75.96 8 
73.431 59.880 3.754 1.00 75.27 7 
72.086 59.867 4.308 1.00 72.77 6 
71.307 58.758 3.639 1.00 71.14 6 
72.085 57.578 3.703 1.00 68.45 8 
72.177 59.585 5.806 1.00 72.84 6 
73.254 59.212 6.304 1.00 73.59 8 
71.065 59.789 6.550 1.00 71.65 7 
69.793 60.434 6.160 1.00 70.63 6 
71.091 59.521 7.989 1.00 69.54 6 
69.627 59.649 8.376 1.00 69.50 6 
69.172 60.783 7.505 1.00 69.75 6 
71.611 58.104 8.128 1.00 67.45 6 
71.371 57.288 7.249 1.00 67.47 8 
72.332 57.813 9.202 1.00 65.90 7 
72.888 56.480 9.401 1.00 65.38 6 
74.336 56.595 9.864 1.00 66.99. 6 
74.623 57.933 10.513 1.00 69.68 6 
75.809 58.349 10.511 1.00 72.85 8 
73.659 58.564 11.022 1.00 69.14 8 
72.078 55.656 10.387 1.00 64.69 6 
72.273 54.438 10,492 1.00 62.72 8 
71.194 56.334 11.122 1.00 63.38 7 
70.291 55.692 12.073 1.00 62.59 6 
70.703 55.968 13.502 1.00 63.56 6 
71.812 55.147 14.064 1.00 66.36 6 
72.073 55,601 15.478 1.00 69.74 6 
72.311 56,794 15.711 1.00 71.52 8 
72.005 54.676 16.437 1.00 70.16 7 
68.850 56.189 11.932 1.00 60.94 6 
68.599 57.336 11.548 1.00 60.74 8 
67.910 55.318 12.281 1.00 57.54 7 
66.495 55.652 12.254 1.00 54.00 6 
65.857 55.296-10.901 1.00 52.88 6 
66.391 56.201 9.814 1.00 52.07 6 
66.151 53.846 10.564 1.00 53.11 6 
65.816 54.844 13.349 1.00 51.46 6 
66.355 53.838 13.804 1.00 51.44 8 
64.649 55.299 13.789 1.00 48.05 7 
63.893 54.592 14.812 1.00 45.32 6. 
63.222 55.583 15.761 1.00 44.86 6 
64.122 56.022 16.763 1.00 42.99 8 
62.846 53.727 14.123 1.00 45.51 6 
61.959 54.228 13.431 1.00 45.94 8 
62.953 52.420 14.325 1.00.44.89 7 
62.052 51.463 13.706 1.00 43.76 6 
62.857 50.400 12.943 1.00 44.31 6 
61.930 49.411 12.296 1.00 45.29 6 
63.734 51.060 11.907 1.00 45.23 6 
61.169 50.751 14.718 1.00 43.81 6 
61.641 50.331 15.772 1.00 46.08 8 
59.868 50.612 14.418 1.00 41.91 7 
59.085 51.238 13.344 1.00 41.05 6 
58.987 49.923 15.360 1.00 39.46 6 
57.619 50.076 14.719 1.00 39.68 6 
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63.513 40.766 14.999 1.00 38.10 6 

64.356 41.636 15.086 1.00 39.18 8 

63.068 40.104 16.049 1.00 39.36 7 

63.578 40.395 17.367 1.00 38.27 6 

62.562 41.277 18.141 1.00 39.12 6 

62.919 41.352 19.596 1.00 43.60 6 

62.557 42.678 17.565 1.00 38.09 6 

63.853 39.081 18.089 1.00 37.11 6 

63.154 38.098 17.896 1.00 39.80 8 

64.909 39.039 18.899 1.00 35.89 7 

65.921 40.088 19.075 1.00 38.94 6 

65.276 37.842 19.651 1.00 35.46 6 

66.485 38.306 20.456 1.00 36.52 6 

67.087 39:306 19.600 1.00 37.91 6 

64.134 37.398 20.555 1.00 36.04 6 

63.541 38.220 21.246 1.00 34.75 8 

63.839 36.103 20.565 1.00 33.87 7 

62.771 35.595 21.400 1.00 35.26 6 
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85 62.150 34.350 

85 63.150 33.259 

85 64.268 33.583 

85 62.828 32.079 

85 63.277 35.290 

85 63.139 34.174 

86 63.848 36.307 
86 64.387 -36.185 
86 65.211 37.414 
86 66.410 37.646 
86 67.131 38.893 
86 67.300 36.446 
86 63.317 36.021 

86 62.226 36.557 

87 63.652 35.293 
87 62.727 35.060 
87 61.950 33.766 
87 63.510 34.959 

87 64.583 34.372 

88 62.989 35.550 
88 63.639 35.460 
88 63.259 36.638 
88 63.154 34.168 

88 62.028 34.089 

89 64.008 33.152 
89 63.691 31.832 
89 64.970 31.010 
89 65.633 30.711 
89 66.903 30.120 
89 67.519 29.831 
89 64.995 31.006 
89 65,599 30.720 
89 66.860 30.131 
89 67.437 29.821 
89 62.959 31.792 

89 62.113 30.920 

90 63.275 32,723 
90 62.621 32.729 
90 63.658 32.682 
90 64.654 33.809 
90 65.197 34.116 
90 64.914 34.425 
90 61.668 33.894 

90 61.397 34.296 

91 61.170 34.437 
91 6*0.207 35.526 
91 59.974 36.095 
91 58.937 34.881 

91 58.543 33.800 

92 58.306 35.569 
92 57.111 35.100 
92 57.301 35.365 
92- -56.517 36.598 
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185 67.578 18.509 

186 67.460 17.831 
186 66.968 16.497 
186 66.593 15.755 
186 65.784 16.580 
186 65.770 16.594 

186 65.612 15.764 

187 64.948 17.624 
187 63.753 17.876 
187 . 64.119 18.104 
187 63.463 17.631 
187 63.043 19.179 

187 63.980 20.757 

188 65.204 18.841 
188 65.589 19.318 
188 67.013 18.991 
188 68.012 19.334 
188 65.319 20.833 

188 63.808 21.349 

189 67.108 18.340 
189 65.864 18.064 
189 68.292 17.878 
189 67.738 17.666 
189 66.345 17.173 
189 69.547 18.782 

189 70.592 18.398 

190 69.450 19.961 
190 70.592 20.878 
190 70.358 21.881 
190 69.520 21.352 
190 70.336 20.533 
190 71.471 20.965 
190 69.836 19.472 
190 70.822 21.663 

190 70.095 21.489 

191 71.826 22.543 
191 72.142 23.390 
191 73.651 23.536 
191 71.502 24.787 

191 71.379 25.340 

192 71.097 25.355 
192 70.487 26.678 
192 69.025 26.613 
192 68.096 25.953 
192 67.939 24.564 
192 67.146 23.934 
192 67.435 26.705 
192 66.642 26.094 
192 66.505 24.710 
192 65.744 24.101 
192 71.262 27.694 

192 71.181 27.699 

193 72.010 28.557 
193 72.792 29.606 
193 73.643 30.372 
193 74.722 29.522 
193 75.625 30.345 
193 76.083 31.435 
193 75.881 29.898 
193 71.890 30.600 
193 70.747 30.828 
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72.418 31.187 27.566 1.00 62.00 7 

71.683 32.168 28.340 1.00 60.31 6 

70.644 31.480 29.235 1.00 62.01 6 

71.268 30.749 30.413 1.00 64.58 6 

71.200 29.504 30.448 1.00 66.44 8 

71.824 31.415 31.314 1.00 64.95 8 

72.637 32.989 29.193 1.00 59.18 6 

73.715 32.515 29.570 1.00 59.24 8 

72.238 34.223 29.484 1.00 56.73 7 

73.029 35.121 30.311 1.00 55.51 6 
73.019 36.555 29.763 1.00 53^92 6 
73.686 37.498 30.752 1.00 53.41 6 
73.738 36.595 28.431 1.00 55.34 6 
72.453 35.145 31.715 1.00 56.03 6 
71.270 35.400 31.907 1.00 56.70 8 
73.292 34.868 32.702 1.00 57.06 7 
72.834 34.870 34.077 1.00 57.01 6 
73.402 33.673 34.821 1.00 57.41 6 
72.908 33.555 36.238 1.00 61.42 6 
73.533 32.383 36.968 1.00 62.69 6 
73.377 31.235 36.493 1.00 64.88 8 
74.184 32.604 38.013 1.00 64.76 8 
73.315 36.160 34.715 1.00 57.51 6 
74.518 36.427 34.755 1.00 60.16 8 
72.375 36.969 35.192 1.00 55.54 7 
72.712 38.226 35.829 1.00 54.09 6 
71.853 39.380 35.278 1.00 50.90 6 
72.241 40.685 35.932 1.00 47.64 6 

72.030 39.471 33.782 1.00 49.59 6 

72.473 38.067 37.325 1.00 56.97 6 
71.414 37.614 37.748 1.00 58.03 8 
73.476 38.409 38.125 1.00 58.05 7 
73.338 38.300 39.562 1.00 58.55 6 
74.550 37.584 40.174 1.00 58.65 6 
74.556 36.209 39.811 1.00 60.87 8 
73.182 39.685 40.155 1.00 58.53 6 
74.049 40.549 40.004 1.00 59.57 8 
72.060 39.895 40.823 1.00 58.51 7 
71.803 41.176 41.434 1.00 59.74 6 
70.361 41.612 41.186 1.00 59.28 6 
69.921 42.861 41.953 1.00 58.45 6 
70.758 44.059 41.543 1.00 57.64 6 
68.466 43.127 41.669 1.00 57.53 6 
72.039 41.084 42.917 1.00 61.63 6 
71.226 40.500 43.640 1.00 64.16 8 
73.163 41.637 43.366 1.00 61.69 7 
73.486 41.658 44.780 1.00 58.91 6 
74.981 41.477 44.994 1.00 61.22 6 
75.355 41.522 46.454 1.00 63.36 6 
74.686 40.916 47.295 1.00 64.82 8 
76.426 42.235 46.770 1.00 64.37 7 
73.048 43.026 45.280 1.00 56.95 6 
73.610 44.056 44.905 1.00 58.64 8 
72.018 43.030 46.109 1.00 54.18 7 

71.474 44.260 46.650 1.00 52.16 6 
70.257 44.688 45.844 1.00 50.83 6 
69.065 43.780 46.028 1.00 47.47 6 
67.923 44.233 46.681 1.00 47.04 6 
69.107 42.458 45.601 1.00 44.93 6 
66.843 43.382 46.911 1.00 45.49 6 
68.043 41.608 45.829 1.00 43.88 6 
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Table 1: Structural coordinates of AChBP 

"Atom type" refers to the element whoose coordinate are measured. The first letter in 
the column defines the element. 

"Residue" refers to the amino acid in the AChBP protein sequence, using the 
standard three letter abbreviations known in the art. 
"#" refers to the residue number. 

"X, Y, T crystallographically define the atomic position, in three-dimensional space, 
of the element measured. 
"OCC" is the occupancy volume. 

"B" is a thermal factor that measures movement of the atom around its atomic center. 
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Claims 



1 . A water-soluble protein derived from a mollusc being capable of binding a 
ligand of a ligand-gated receptor. 

5 

2. The protein of claim 1, wherein the ligand is acetylcholine, gamma-amino- 
butyric acid (GABA), glycine or serotonin. 

3. The protein of claim 2, wherein said protein is a acetylcholine-binding protein 
10 (AChBP). 

4. The protein of any one of claim 1 to 3 which is capable of forming multimers. 

5. The protein of any one of claims 1 to 4 which is derived form a Pulmonata 
1 5 species, preferably from a Basommatophora species. 

6. The protein of any one of claims 1 to 5 comprising an amino acid sequence 
selected from the group consisting of: 

(a) an amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 
20 6 or 8 or a functional equivalent thereof, or a fragment of at least 5 

continuous amino acids thereof; 

(b) an amino acid sequence having at least 30% amino acid identity to the 
amino acid sequence of any one of SEQ ID Nos. 2, 4, 6 or 8. 

25 7. A water-soluble ligand binding protein capable of binding a ligand of a ligand- 
gated receptor and comprising at least 5 continuous amino acids of the 
aminoacid sequence depicted in any one of SEQ ID Nos. 2, 4, 6 or 8 and/or 
said protein is detectable by a monoclonal or polyclonal antibody which 
recognises, preferably with a binding affinity of at least 10* 7 M, a protein of any 

30 one of claims 1 to 6. 



8. A water-soluble protein being capable of binding a ligand of a ligand-gated 
receptor comprising 

(a) at least the amino acids of the water-soluble protein of any one of 
35 claims 1 to 6 determining solubility of said protein, in the same or 

corresponding positions as in said protein; and 

(b) at least 4 amino acids determining binding to said ligand. 
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9. The protein of claim 7 or 8 which is capable of forming multimers. 

1 0. The protein of any one of claims 7 to 9 comprising 200-240 amino acids. 

11. The protein of any one of claims 7 to 1 0, wherein the ligand is acetylcholine, 
nicotine, lophotoxin, d-tubocurarine, carbamylcholine, galanthamine or 
epibatidine. 

10 12. The protein of any one of claims 1 to 1 1 , wherein said ligand-gated receptor 
is derived from an arthropod (preferably insect), a plant (preferably a higher 
plant, most preferably a seed plant) or a chordate (preferably a mammalian, 
most preferably human). 

15 13. The protein of any one of claims 7 to 1 2, wherein said ligand-gated receptor is 
a nicotinic acetylcholine receptor. 

14. The protein of any one of claims 7 to 13, wherein said amino acids 
determining solubility are in the same positions as in the AChBP having the 

20 amino acid sequence as depicted in any one of SEQ ID Nos. 2, 4, 6 or 8; 

preferably in which said solubility-determining amino adds comprise solvent 
accessible regions in the crystal structure according to Figure 10. 

1 5. The protein of any one of claims 7 to 14 comprising an amino acid sequence 
25 having at least 40% amino acid identity to the amino acid sequence 20-223 of 

any one of SEQ ID Nos. 2, 4, 6 or 8, in which the ligand binding amino acids 
have been replaced with the corresponding amino acids of a ligand-gated 
receptor. 

30 16. The protein of any one of claims 7 to 15, in which said solubility-determining 
amino acids (a) comprise hydrophilic amino acids (Asp, Glu, Arg, Lys) from 
the sequences 20-44, 73-81,. 86-92, 112-120, 135-152, 166-189, 196-20, 
209-21 3, and/or 21 9-227 of SEQ ID No. 2. 



35 17. 



The protein of claim 16, in which said solubility determining amino acids (a) 
comprise amino acids Asp(36), Asp(68), Glu(115), Arg(137), Asp(143), 
Asp(148), Glu(150), Arg(167), Arg(189), Glu(215) of SEQ ID No.2, wherein 
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Asp may be exchanged for Glu and vice versa and Lys may be exchanged for 
Arg and vice versa. 

18. The protein of any one of claims 7 to 17 which further comprises the amino 
5 acids Cys(142), Thr(149), Ala(153), Thr(154), Cys(155), Arg(156), lle(157) 

and/or Lys(158) of SEQ ID No. 2. 

19. The protein of any one of claims 7 to 17 which comprises the amino acids (b) 
Pro(39), Trp(77), Trp(101), Pro(103), Asp(194), and/or Ser(161) of SEQ ID 

10 No. 2. 

20. The protein of any one of claims 7 to 19 in which the amino acid sequences 
165-169 and/or 200-203 of SEQ ID No. 2 have been exchanged with the 
corresponding sequence of the ligand-gated receptor. 

15 

21 . The protein of any one of claims 7 to 20 which is capable of binding a ligand 
of an acetylcholine receptor, in which at least one of the amino acid 
sequences Trp(101) - Tyr(T08), Trp(162) - His(164) and Tyr(204) - Tyr(211) 
of SEQ ID No. 2 have been exchanged with the corresponding sequence of 

20 the acetylcholine receptor. 

22. A method for the production of a water-soluble ligand-gated receptor or a 
corresponding ligand-binding domain or for improving the water solubility and 
accessibility to crystallization of such a receptor or domain, said method 

25 comprising altering the amino acid sequence of the extracellular domain of a 

ligand-gated receptor by way of substituting, adding, deleting or modifying at 
least one amino acid at a position corresponding to an amino acid 
determining or contributing to the water-solubility of the protein of any one of 
claims 1 to 21 . 

30 

23. The method of claim 22, wherein the ligand-gated receptor is defined as in 
any one of claims 1 to 21 . 

24. The method of claim 22 or 23, wherein at least one amino acid is altered to 
35 the corresponding amino acid of the amino acid sequence depicted in any 

one of SEQ ID Nos. 2, 4, 6 or 8, or to a an equivalent amino acid, preferably 
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in which said solubility-determining amino acids comprise solvent accessible 
regions in the crystal structure according to Figure 10. 

25. The method of any one of claims 22 to 24, wherein loop Cys123-Cys136 of 
5 SEQ ID No. 2 is inserted into the corresponding region of the ligand binding 

domain of the ligand-gated receptor. 

26. The method of any one of claims 22 to 25 further comprising 

(a) culturing a host cell transfected with and capable of expressing a 

1 0 polynucleotide comprising a nucleotide sequence encoding the altered 

amino acid sequence; and optionally 

(b) recovering said water-soluble ligand-gated receptor or corresponding 
ligand-binding domain from the culture. 

15 27. A water-soluble ligand-gated receptor or ligand-binding domain obtainable by 
the method of any one of claims 22 to 26. 

28. The protein of any one of claims 1 to 21 or 27 further comprising a spacer 
sequence allowing coupling with a carrier body. 

20 

29. A fusion protein comprising the water-soluble ligand-binding protein of any 
one of claims 1 to 21 , 27 or 28, or a binding fragment thereof and a fragment 
of a ligand-gated receptor 

25 30. A dimer or pentamer consisting of at least one monomer comprising a protein 
of any one of claims 1 to 21 or 27 to 29. 

31 . A ligand-gated ion channel comprising a protein of any one of claims 1 to 21 
or 27 to 29 or the dimer or pentamer of claim 30. 

30 

32. One or more polynucleotides encoding the protein of any one of claims 1 to 
21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand-gated ion 
channel of claim 31 . 

35 33. The polynucleotide(s) of claim 32 which comprise(s) 



WO 01/58951 



Page 214 of 259 



WO 01/58951 PCT/EP01/01457 

-213 

(a) a nucleotide sequence having at least 1 5 continuous nucleotides of 
the nucleotide sequence depicted in any one of SEQ ID Nos. 1 t 3, 5 or 
7 or a degenerated sequence thereof; or 

(b) a nucleotide sequence capable of hybridizing to a nucleotide 
5 sequence of (a) under stringent hybridisation conditions. 

34. The polynucleotide(s) of claim 32 or 33 which is(are) operatively linked to 
heterologous expression control sequences allowing expression inprokaryotic 
or eukaryotic cells. . 

10 

35. One or more vector(s) containing the polynucleotide(s) of any one of claims 
32 to 34. 

36. A host cell genetically engineered with the polynucleotide(s) of any one claims 
15 32 to 34 or with the vector(s) of claim 35. 

37. An antigen comprising an epitope of at least 5 continuous amino acids of the 
amino acid sequence depicted in any one of SEQ ID Nos. 2, 4, 6 or 8 and/or 
said epitope is detectable by a monoclonal or polyclonal antibody which 

20 recognises, preferably with a binding affinity of at least 10' 7 M, a protein of any 

one of claims 1 to 6. 

38. An antibody specifically recognizing the protein of any one of claims 1 to 21 or 
27 to 29, the dimer or pentamer of claim 30, the ligand-gated ion channel of 

25 claim 31 or the antigen of claim 37. 

39. An oligonucleotide probe comprising a nucleotide sequence having at least 1 5 
continuous nucleotides of a polynucleotide of any one claims 32 to 34 or 
encoding the antigen of claim 37. 

30 

40. A composition comprising the protein of any one of claims 1 to 21 or 27 to 29, 
the dimer or pentamer of claim -30, the ligand-gated ion channel of claim 31, 
the polynucleotide(s) of any one claims 32 to 34, the vector(s) of claim 35, the 
host cell of claim 36, the antigen of claim 37, the antibody of claim 38, or an 

35 oligonucleotide probe of claim 39; and optionally suitable means for detection 

or performing a ligand-receptor binding assay. 
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41 . A method for identifying an agonist/activator or antagonist/inhibitor of a ligand- 
gated receptor comprising the steps of: 

(a) contacting the water-soluble ligand-binding protein of any one of 
claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30, the 
5 ligand-gated ion channel of claim 31 or a cell expressing said 

protein in the presence of components capable of providing a 
detectable signal in response to ligand binding with a compound to be 
screened under conditions that permit binding of said compound 
to the ligand-binding protein; and 
1 0 (b) detecting the presence or absence of a signal generated from the 

binding activity of the ligand-binding protein, wherein the 
presence/increase and absence/decrease of the signal is indicative for 
an agonist/activator and antagonist/inhibitor, respectively, of a ligand- 
gated receptor. 

15 

42. A crystal of a protein of any one of claims 1 to 21 or 27 to 29, the dimer or 
pentamer of claim 30 or the ligand-gated ion channel of claim 31 . 

43. A crystal of a protein-ligand complex comprising a protein of any one of 
20 claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand- 
gated ion channel of claim 31; and a ligand. 

44. The crystal of claim 43, wherein the ligand comprises an N-alkylated 
hydroxyalkyl and/or a quaternary ammonium ion. 



25 



30 



35 



45. The crystal of claim 43, wherein the ligand comprises 4-(2-HydroxyethyI)-1- 
piperazineethanesulfonic acid (HEPES), B-bippinatin, lophotoxin, d- 
tubocurarine, carbamylcholine, galanthamine, epibatidine or alpha- 
bungarotoxin. 

46. The crystal of any one of claims 42 to 45, wherein the crystal effectively 
diffracts X-rays for the determination of the atomic coordinates of the protein 
or protein-ligand complex to a resolution of greater than 5.0, preferably 
greater than 4.0 Angstroms. 

47. The crystal of any one claims 42 to 46, wherein the protein has an amino acid 
sequence of amino acids 20 to 223 of SEQ ID No. 2, or an amino acid 
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sequence that differs from amino acid 20 to 223 of SEQ ID No. 2 by only 
having conservative substitutions. 

48. The crystal of claim 47, wherein the ligand is HEPES. 

5 

49. The crystal of claim 46 having (1) a space group of P2\1fa and a unit cell of 
dimensions of a=120.6A, b=137.0A and c=161.5A; (2) a space group of 
P4 2 2 1 2 and a unit cell of dimensions of a=b=141.6A and c=120.8A or (3) a 
space group of P2i and a unit cell of dimensions of a=121.lA, b=162.lA, 

10 c=139,4A, 3=90.1°. 

50. The crystal of any one of claims 42 to 49, wherein the protein has secondary 
structural elements that include .alpha.-helix and antiparallel .beta.-sheets as 
shown in Figure 7, 10, 11 and/or 12, 

15 

51 . The crystal of any one claims 42 to. 50 having a three-dimensional structure 
as defined by atomic coordinates shown in Table 1. 

52. The crystal of any one of claims 42 to 51 having a binding cavity as shown in 
20 Figure 6, 8, 9 and/or 13. 

53. A method of using the crystal of any one of claims 42 to 52 in a drug 
screening assay comprising: 

(a) selecting a potential ligand by performing structure assisted drug 

25 design with the three-dimensional structure determined for the. crystal, 

wherein said selecting is performed in conjunction with computer 
modeling; optionally 

(b) contacting the potential ligand with the ligand binding domain of 
the ligand-gated receptor in an in vitro or in vivo assay; and 

30 (c) detecting the binding of the potential ligand for the ligand binding 

domain. 

54. The method of claim 53, wherein the ligand-gated receptor is a nicotinic 
acetylcholine receptor. 



35 



55. The method of claim 53 or 54 further comprising: 
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(d) forming a supplemental crystal of a protein-ligand complex by co- 
crystallization or soaking the crystal of the water-soluble ligand-binding 
protein of any one of claims 1 to 21 or 27 to 29, the dimer or pentamer 
of claim 30 or the ligand-gated ion channel of claim 31 , with a potential 

5 drug, wherein the crystal effectively diffracts X-rays for the 

determination of the atomic coordinates of the protein-ligand complex 
to a resolution of greater than 5.0, preferably greater than 4.0 
Angstroms, more preferably greater than 3; 

(e) determining the three-dimensional structure of the supplemental 
10 crystal; 

(f) selecting a candidate drug by performing a structure assisted drug 
design with the three-dimensional structure determined for the 
supplemental crystal, wherein said selecting is performed in 
conjunction with computer modeling; optionally 

1 5 (g) contacting the candidate drug with a cell that expresses the ligand- 

gated receptor; and 
(h) detecting a cell response; wherein a candidate drug is identified as 
a drug when the cell response is altered compared to a cell that 
has not been contacted with the candidate compound. 

20 

56. The method of any one of claims 53 to 55 further comprising an initial step 
that precedes step (a) wherein said initial step consists of determining the 
three-dimensional structure of a crystal comprising a protein-ligand complex 
formed between the water-soluble ligand-binding protein of any one of claims 

25 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the ligand-gated ion 

channel of claim 31 , and the ligand of the ligand-gated receptor, wherein the 
crystal effectively diffracts X-rays for the determination of the atomic 
coordinates of the protein-ligand complex to a resolution of greater .than 5.0, 
preferably greater than 4.0 Angstroms. 

30 

57. A method of growing a crystal of a protein-ligand complex comprising: 
(a) contacting the water-soluble ligand-binding protein of any one of 

claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30 or the 
ligand-gated ion channel of claim 31 with a ligand of a ligand- 
35 gated receptor, wherein the water-soluble ligand-binding protein 

forms a protein-ligand complex with the ligand; and 
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(b) growing the crystal of the protein-ligand complex; wherein the 
crystal effectively diffracts X-rays for the determination of the 
atomic coordinates of the protein-ligand complex to a resolution 
of greater than 5.0, preferably greater than 4.0 Angstroms. 

5 

58. A drug screening assay comprising soaking the crystal of any one of claims 
42 to 52 in a solution of compounds to be screened and detecting the binding 
of the compound to the ligand-binding protein. 

10 59. The method of claim 57 or 58, wherein said ligand comprises an alkylated 
nitrogen and/or quaternary ammonium ion. 

60. A method of increasing or decreasing the affinity of a drug to a ligand-gated 
receptor, comprising 

1 5 (a) performing structure assisted drug design with the three-dimensional 

structure determined for the crystal of any one of claims 42 to 52, 
wherein said drug design is performed in conjunction with computer 
modeling; and 

(b) modifying said drug to alter or eliminate a portion thereof 

20 suspected of interacting with a binding site of the binding cavity or 

with a non-specific binding site of the protein in the crystal. 

61 . The method of claim 60, wherein step (a) further comprises the steps of a 
method of any one of claims 53 to 59. 

25 

62. The method of claim 60 or 61 , further comprising after step (b), the additional 
step of : 

(c) repeating the method used to perform structure assisted drug design 
according to step (a) using the modified drug according to step (b). 

30 

63. A method of drug design comprising the step of using the structural 
coordinates of a water-soluble ligand-binding protein crystal comprising the 
coordinates of Table 1, to computationally evaluate a chemical entity for 
associating with the ligand-binding site or a non-specific binding site of a 

35 ligand-binding protein. 
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64. The method of any one of claims 53 to 63, wherein the identified drug 
prevents or promotes correct assembly of a ligand-gated ion channel. 

65. The method of any one of claims 53 to 63, wherein the identified drug binds to 
5 a non-specific binding site of a ligand-gated ion channel. 

66. The method of any one of claims 53 to 65 further comprising synthesizing the 
drug in a therapeutically effective amount. 

10 67. A drug produced by the method of claim 66 or a pro-drug thereof. 

68. The drug of claim 67 which interacts with a ligand-gated receptor comprising 
a pentamer of claim 30 with monomers A to E, wherein the drug binds to one 
or more primary contact regions of a monomer (residues from A contacting B) 

1 5 defined by amino acid residues 1 5 to 21 , 44 to 47, 85 to 87, 91 to 94, 1 22 to 

124, 143 to 146, 149, 185 to 187 of the mature protein of SEQ ID No. 2 and/or 
to one or more of the complementary contact regions of the other monomer 
(from B contacting A, (identical to residues on A contacting E) defined by 
amino acid residues 3 to 4, 7 to 8, 1 1 , 37 to 39, 53, 75 to 77, 96 to 104, 1 14 to 

20 118 and 163-170 of the mature protein of SEQ ID No. 2; or to the contact 

regions as identified in Figure 14; or to the corresponding contact regions of 
the monomers of a ligand-gated ion channel. 

69. The drug of claim 68, wherein the ligand-gated ion channel is the nicotinic 
25 acetylcholine receptor and the order of the monomers is ayapS. 

70. A computer readable medium comprising a nucleotide sequence of the 
polynucleotide(s) of any one of claims 32 to 34, an amino acid sequence of a 
protein of any one of claims 1 to 21 or 27 to 29, the dimer or pentamer of 

30 claim 30 or the ligand-gated ion channel of claim 31, or the structural 

coordinates of a crystal of any one of claims 40 to 50. 

71 . A device comprising the computer readable medium of claim 70. 

35 72. Use of the computer readable medium of claim 70 or the device of claim 71 
for modeling an antagonist/inhibitor or agonist/activator of a ligand-gated 
receptor. 
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73. Use of the crystal of any one of claims 42 to 52 or its structural coordinates as 
a template for modeling the 3D structure of a ligand-gated ion channel. 

5 74. Use of the polynucleotide(s) of any one of claims 32 to 34, the protein of any 
one of claims 1 to 21 or 27 to 29, the dimer or pentamer of claim 30, the 
ligand-gated ion channel of claim 31, the vector(s) of claim 35, the host cell of 
claim 36, the antigen of claim 37, the antibody of claim 38, an oligonucleotide 
probe of claim 39, the crystal of any one of claims 42 to 52 or a method of any 
10 one of claims 53 to 66 for screening or profiling putative ligands of ligand- 

gated receptors. 

75. Use of an antagonist/inhibitor or agonist/activator identified according to a 
method of any one of claims 53 to 66 for the preparation of a pharmaceutical 

15 composition for the treatment of a ligand-gated ion channel mediated or 

related disorder. 

76. The use of claim 75, wherein the antagonist/inhibitor is or is derived from the 
protein of any one of claims 1 to 21 or 27 to 29, an antigen of claim 37, an 

20 antibody of claim 38 or from a toxin of the ligand-gated ion channel. 

77. The use of claim 75, wherein the agonist/activator is or is derived from a the 
protein of any one of claims 1 to 21 or 27 to 29, an antigen of claim 37, an 
antibody of claim 38 or from epibatidine, acetylcholine, choline, nicotine, 

25 carbachol, serotonin or GABA. 

78. The use of any one claims 75 to 77, wherein the ligand-gated ion channel is 
the nicotinic acetylcholine receptor and said mediated or related disorder is 
Tourette's syndrome, Alzheimer's disease, addiction to nicotine or 

30 schizophrenia. 

79. Use of ligand of a ligand-gated receptor for identifying and isolating a water- 
soluble ligand-binding protein from a mollusc. 



35 80. 



The use of claim 79, wherein said ligand is a-bungarotoxin. 
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L-AChBP_Tl. — MRRNIFCLACLWIVQACLSLDRADILYNIRQTSRPDVIPTQRDRPVAVSVSLKFINIL 

L-AChBPJT2. — MRRN I FCLACLWI VQGCLSLDRADILYNI RQTSRPDVI PTQRDRPVAVSVSLKFIN I L 

B-AChBPJTl. MAELRRIILLLCTIAFHVSHGQIRWTLLNQITGES — DVIPLSNNTPLNVSLNFKLMNIV 

B-AChBPJT2. MAELRGIILLLCTIAFHVSHGQIRWTLLNQITGES — DVIPLSNNTPLNVSLNFKLMNIL 

* * : * * .: . * ;* ;* * **** t . * . ++••+••*+. 

L-AChBP_Tl. E VNE I TNE VD W FWQQTTWSDRTLA- WNS S HS PDQVS VPI S S LWVPDLAAYNA I SKPE VL 

L-AChBPJT2. EVN E I TNEVD W FWQQTTWS DRTLA- WNS SHS PDQVS VP I SS LWVPDLAAYNA I SKPE VL 

B-AChBP_Tl. EADTEKDQVEWLWTQASWKVPYYSSLLSSSSLDQVSLPVSKMWTPDLSFYNAIAAPELL 
B-AChBP J2. EADTEKDQVEWLWTQASWKVPYYSSLLSSSSLDQVSLPASKMWTPDLSFYNAIAAPELL 
*.: .::*:**:* *+ * **★*.* * .* *♦*. + *★★. 



L-AChBP_Tl. 
L-AChBP_T2 . 
B-AChBPJTl. 
B-AChBP T2. 



L-AChBP_Tl. 
L-AChBP_T2. 
B-AChBPJTl. 
B-AChBP T2. 



TPQLARWSDGEVLYMPSIRQRFSCDVSGVDTESGATCRIKIGSWTHHSREISVDPTTE- 
TPQLARWSDGEVLYMPSIRQRFSCDVSGVDTESGATCRIKIGSWTHHSGEISVDPTTE- 
SADRVWSKDGSVIYVPSQRVRFTCDLINVDTEPGATCRIKVGSWTHDNKQFALITGEEG 
STDRVWSKDGSVIYVPSQRVRFTCDLINVDTEPGATCRIKVGSWTFDNKQLALITGEEG 
: . : . * .**.*:*:** * * * : * + : + * 

NSDDSEYFSQYSRFEILDVTQKKNSVTYSCCPEAYEDVEVSLNFRKKGRSEIL 
NSDDSEYFSQYSRFEILDVTQKKNSVIYSCCPEAYEDVEVSLNFRKKGRSEIL 

WNIAEYFDS-PKFDLLSATQSLNRKKYSCCENMYDDIEITFAFRKK 

VVNIAEYFDS-PKYDLLSATQSLNRKKYRCCENMYEDIEITFAFRKK 

: :***,. .::::*..**. * * * * . . +.★.*... +*** 



Figure 1 
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Average Hydrophobic^ =* «,1S9 



2D 




Average HydrophoMcity = .,189 



Figure 2 (continued) 
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L-AChBPJTl LDRADI LYN I RQTSRPDVI PT 

L-AChBPJT2 LDRADILYNIRQTSRPDVIPT 

B-AChBP_Tl QIRWTLLNQITGES — DVIPL 

B-AChBP__T2 QIRWTLLNQITGES— DVIPL 

h_nAChR_A7 MRC S PGG VWLALAAS LLH V SLQGEFQRKLYKELVKNYNPLERPV 

h_5HT3 MLLWVQQALLALLLPTLLAQGEARRSRNTTRPALLRLSDYLLTNYRKGVRPV 

h_GlyR_al -MYSFNTLRLYLSGAIVFFSLAASKEAEAARSATKPMSPSDFLDKLMGRTSGYDARIRPN 

h_GABAA_bl MRKSPGLSDCLWAWILLLSTLTGRSYGQPSLQDELKDNTTVFTRILDRLLDGYDNRLRPG 

L-AChBPJTl ■ QRDR-PVAVSVSLKFINILEVNEITNEVDWFWQQTTWSDRTLA — WNSSHSPDQVSVPI 

L-AChBPJT2 QRDR-PVAVSVSLKFINILEVNEITNEVDWFWQQTTWSDRTLA — WNSSHSPDQVSVPI 

B-AChBP__Tl SNNT-PLNVSLNFKLMNIVEADTEKDQVEWLWTQASWKVPYYSS-LLSSSSLDQVSLPV 

B-AChBPJT2 SNNT-PLNVSLNFKLMNILEADTEKDQVEWLWTQASWKVPYYSS-LLSSSSLDQVSLPA 

h_nAChR__A7 ANDSQPLTVYFSLSLLQIMDVDEKNQVLTTNIWLQMSWTDHYLQWNVSEYPGVKTVRFPD 

h_5HT3 RDWRKPTTVSIDVIVYAILNVDEKNQVLTTYIWYRQYWTDEFLQWNPEDFDNITKLSIPT 

h - G1 y R -. al FKG P- PVNVSCNI FINS FGS IAETTMDYRVNI FLRQQWNDPRLAYNEYPDDSLDLDPSML 

h_GABAA_bl LGER-VTEVKTDIFVTSFGPVSDHDMEYTIDVFFRQSWKDERLKF-KGPMTVLRLNNLMA 

* - - : .: : *. 

L-AChBPJTl SSLWVPDLAAYNAISK-PEVLTPQ — LARWSDGEVLYMPSIRQRFSCDVSG-VDTESGA 

L-AChBP_T2 SSLWVPDLAAYNAISK-PEVLTPQ— LARWSDGEVLYMPSIRQRFSCDVSG-VDTESGA 

B-AChBPJTl SKMWTFDLSFYNAIAA- PELLS AD — RVWSKDGSVIYVPSQRVRFTCDLIN-VDTEPGA 

B-AChBPJT2 SFCMWTPDLS FYNAIAA-PELLSTD — RVVVSKDGSVIYVPSQRVRFTCDLIN-VDTEPGA 

h_nAChR_A7 GQIWKPDILLYNSADERFDATFHT — NVLVNSSGHCQYLPPGIFKSSCYIDVRWFPFDVQ 

h_5HT3 DSIWVPDILINEFVDV-GKSPNIP — YVYIRHQGEVQNYKPLQVVTACSLDIYNFPFDVQ 

h_GlyR_al DSIWKPDLFFANEKGAHFHEITTDNKLLRISRNGNVLYSIRITLTLACPMDLKNFPMDVQ 

h_GABAAJbl SKIRTPDTFFHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRLTVRAECPMHLEDFPMDAH 



L-AChBP_Tl TCRIKIGSWTHHSREISVDPT— -TE-NSDDSEYFSQYSRFEILDVTQKKNSVTYSCCPE 

L-AChBPjT2 TCRIKIGSWTHHSGEISVDPT TE-NSDDSEYFSQYSRFEILDVTQKKNSVIYSCCPE 

B-AChBPJTl TCRI KVGSWTHDNKQFAL ITG EEGWNI AE YFDS - PKFDLLSATQSLNRKKYSCCEN 

B-AChBPJT2 TCRIKVGSWT FDNKQLALITG EEGWNIAEYFDS-PKYDLLSATQSLNRKKYRCCEN 

h_nAChR_A7 HCKLKFGSWSYGGWSLDLQMQ EA DI SGYI PN-GEWDLVG I PGKRSERFYECCKE 

h_5HT3 NCSLTFTSWLHTIQDINISLWRLPEKVKSDRSVFMNQ-GEWELLGVLPYFREFSMESS-N 

h_GlyR_al TC I MQLES FGYTMNDLI FEWQ EQGAVQVADGLTL- PQFI LKEEKDLRYCTKH YNT -G 

h_GABAA_bl ACPLKFGSYAYTRAEVVYEWTR-EPARSVWAEDGSRLNQYDLLG — QTVDSGIVQSSTG 



L-AChBPJTl AY E DVE VSLN FRKKGRSE I L 

L-AChBPJT2 AYEDVEVSLNFRKKGRSEIL 

B-AChBP_Tl MYDDIEITFAFRKK 

B-AChBP. JT2 MYEDI E I T FAFRKK 

h_nAChR_A7 PYPDVTFTVTMRRRTLYYGL 

h_5HT3 YYAEMKFYWIRRRPLFYVV 

h_GlyR_al KFTCI EARFHLE RQMG Y YL I 

h_GABAA_bl EYVVMTTHFHLKRKIGYFVI 



Figure 3 
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nAChRJi-A2 MGPSCPVFLSFTKLSLWWLLLTPAGGEEAKRPPPRAPGDP LSSPSPTALPQGGSHTE 

nAChR_h-A4 MELGGPGAPRLLP P LLLLLGTGLLRASSHVE 

nAChR_h-A5 MAARGSGPRALRLLLLVQLVAGRCGLAGAAGGAQRGLSEPS 

nAChR_h-A3 MALAALAAPGAVAPRLLL LLLSLLP- - VAR AS E 

nAChR_h-A6 MLTSKGQGFLHGGLCL WLCVFTP— FFKGCVGCA 

nAChR_h-Al MEPWPLLL LFSLCSAGLVLG SE 

n AChR_h -A7 MRCS PGG VWLALAASL LHVSLQG-E FQR 

nAChR_h-A9 — CISFCWIYFAASRLR AAETADG-KYAQ 

B-AChBP_Tl MAELRRI I LLLCT IAFHVSHG-QIR 

B-AChBPJT2 MAELRG 1 1 LLLCT IAFHVSHG-QIR 

L-AChBP_Tl MRRNI FCLACL WIVQACLS-LDR 

L-AChBP_T2 MRRNI FCLACL WIVQGCLS-LDR 



nAChR_h-A2 T EDRLFKHLFRGYNRWARPVPNTSDVVIVRFGLSIAQLIDVDEKNQ^TTNVWLKQ 

nAChR_h-A4 TRAHAEERLLKKLFSGYNKWSRPVANISDWLVRFGLSIAQLIDVDEKNQMMTTNVWVKQ 

nAChRJl-A5 SIAKHEDSLLKDLFQDYERWVRPVEHLNDKIKIKFGLAISQLVDVDEKNQLMTTNVWLKQ 

nAChRJl-A3 A EHRLFERLFEDYNEIXRPVANVSDPVIIHFEVSMSQLVKVDEVNQIMETNLWLKQ 

nAChR_h-A6 T EERLFHKLFSHYNQFIRPVENVSDPVTVHFEVAITQLANVDEVNQIMETNLWLRH 

nAChRJi-Al H ETRLVAKLFKDYSSVVRPVEDHRQWEVTVGLQLIQLINVDEVNQIVTTNVRLKQ 

nAChR_h-A7 KLYKELVKNYNPLERPVANDSQPLTVYFSLSLLQIMDVDEKNQVLTTNIWLQM 

nAChR_h-A9 KLFNDLFEDYSNALRPVEDTDKVLNVTLQITLSQIKDMDERNQILTAYLWIRQ 

B-AChBP_Tl WTLLNQITGES— DVIPLSN-NTPLNVSLNFKLMNIVEADTEKDQVEVVLWTQA 

B-AChBPJT2 WTLLNQITGES — DVIPLSN-NTPLNVSLNFKLMNILEADTEKDQVEVVLWTQA 

L-AChBP_Tl ADILYNIRQTSRPDVIPTQR-DRPVAVSVSLKFINILEVNEITNEVDVVFWQQT 

L-AChBPJT2 ADILYNIRQTSRPDVIPTQR-DRPVAVSVSLKFINILEVNEITNEVDWFWQQT 

nAChR_h-A2 EWSDYKLRWNPTDFGNITSLRVPSEMIWIPDIVLYNNADGEFAVTHMTKAHLFSTGTVHW 

nAChR_h-A4 EWHDYKLRWDPADYENVTSIRIPSELIWRPDIVLYNNADGDFAVTHLTKAHLFHDGRVQW 

nAChR_h-A5 EWI DVKLRWNPDDYGGIKVIRVPSDSVWTPDIVLFDNADGRFEGTS-TKTVIRYNGTVTW 

nAChR_h-A3 IWNDYKLKWNPSDYGGAEFMRVPAQKIWKPDIVLYNNAVGDFQVDDKTKALLKYTGEVTW 

nAChR_h-A6 IWN DYKLRWDPMEYDGIETLRVPADKIWKPDIVLYNNAVGDFQVEGKTKALLKYNGMITW 

nAChR_h-Al QWVDYNLKWNPDDYGGVKKIHIPSEKIWRPDLVLYNNADGDFAIVKFTKVLLQYTGHITW 

nAChR_h-A7 SWTDHYLQWNVSEYPGVKTVRFPDGQIWKPDILLYNSADERFDATFHTNVLVNSSGHCQY 

nAChR_h-A9 IWHDAYLTWDRDQYDGLDSIRIPSDLVWRPDIVLYNKADDESSEPVNTNWLRYDGLITW 

B-AChBP_Tl SWKVPYYSS-LLSSSSLDQVSLPVSKMWTPDLSFYN-AIAAPELLSADRVWSKDGSVIY 

B-AChBP_T2 SWKVPYYS5-LLSSSSLDQVSLPASKMWTPDLSFYN-AIAAPELLSTDRWVSKDGSVIY 

L-AChBP_Tl TWS DRTLA- -WNSSHS PDQVS VPI SSLWVPDLAA YN-AISKPEVLT PQLARWSDGEVLY 

L-AChBPJT2 TWSDRTLA — WNSSHSPDQVSVPISSLWVPDLAAYN-AISKPEVLT PQLARWSDGEVLY 

nAChR_h-A2 VPPAIYKSSCSIDVTFFPFDQQNCKMKFGSWTYDKAKIDLEQMEQ-TVDLKDYWES-GEW 

nAChR_h-A4 TPPAIYKSSCSIDVTFFPFDQQNCTMKFGSWTYDKAKIDLVNMHS-RVDQLDFWES-GEW 

nAChR_h-A5 TPPANYKSSCTIDVTFFPFDLQNCSMKFGSWTYDGSQVDIILEDQ-DVDKRDFFDN-GEW 

nAChR_h-A3 IPPAIFKSSCKIDVTYFPFDYQNCTMKFGSWSYDKAKIDLVLIGS-SMNLKDYWES-GEW 

nAChR_h-A6 TPPAIFKSSCPMDITFFPFDHQNCSLKFGSWTYDKAEIDLLIIGS-KVDMNDFWEN-SEW 

nAChR_h-Al TPPAIFKSYCEIIVTHFPFDEQNCSMKLGTWTYDGSVVAINPESD-QPDLSNFMES-GEW 

nAChR_h-A7 LPPGIFKSSCYI DVRWFPFDVQHCKLKFGSWSYGGWSLDLQM — Q-EADISGYIPN-GEW 

nAChR_h-A9 DAPAITKSSCVVDVTYFPFDNQQCNLTFGSWTYNGNQVDIFNALD-SGDLSDFIED-VEW 

B-AChBPJTl VPSQRVRFTCDLINVDTEPG-ATCRIKVGSWTHDNKQFALITGEEGVVNIAEYFDS-PKF 

B-AChBPJT2 VPSQRVRFTCDLINVDTEPG-ATCRIKVGSWTFDNKQLALITGEEGWNIAEYFDS-PKY 
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AspttS 





4' 



Plus side 
L1 

(ArgA15,AspAl7, 
ValA18. IleA19) 

L2 

(ThrA45) 

L4 

(AspA85) 

L5 

(l!aA92, SerA93 ( LysA94) 



L7 

(SerA122,A123,AspA124) 

L8 

(TrpA143.ThrAl44,WsA145, 
HIsAUB, GluAH9) 

L10 

(TyrA185, SerA186, CysA187) 



Minus side 
ot1, §3 

(ArgB3, AlaB4, LeuB7, 
TyrBB, ProB77) 

L9 

(AxgB170) 

(ProB100. 5 LeuB102) 
PI. 35. 06 

(UuB39. GluB96, 
LeuB98,ArgB116) 

P1.L9 

(AsnB37. TyrB168) 

a1,p2, (33, P5.B6 

(ArgB3. TrpB53, SerB75, 
ThrB99, ArgBIW, B114) 

L9 

(GluB183,TyrB164) 
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SEQUENCE LISTING 

<110> Stichting voor de Technische Wetenschappen 

<12 0> Water-soluble ligand-binding proteins and analogs of 
ligand-gated ion channels, crystals thereof and their 
use for screening ligands of ligand-gated ion channels 

<130> F 1105 PCT 

<140> 
<141> 

<160> 20 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 690 
<212> DNA 

<213> Lymnaea stagnalis 

<220> 

<221> CDS 

<222> (1) . . (687) 

<220> 

<221> mat„peptide 
<222> (58) . . (687) 

<400> 1 

atg cgt cga aac att ttc tgc ctt get tgt etc tgg ate gtg caa gcg 48 
Met Arg Arg Asn lie Phe Cys Leu Ala Cys Leu Trp lie Val Gin Ala 
-15 -10 -5 



tgt eta age ttg gac 
Cys Leu Ser Leu Asp 
-1 1 



egg gca gac ate ttg tac aac ata cgt cag aca 96 
Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
5 10 
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teg aga ccg gat gtg att ccc aca cag cga gat cgc cca gtg gcg gtg 144 
Ser Arg Pro Asp Val lie Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

tec gtc tct ttg aag ttc ate aac ate ttg gaa gtg aat gaa ata acc 192 
Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 
30 35 40 45 

aat gaa gtg gac gtg gtc ttt tgg cag cag acg aca tgg teg gac agg 240 
Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 
50 55 60 

acc etc gee tgg aac agt tct cac tea cca gat cag gtt tec gtg cca 288 
Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

ata age tct ttg tgg gtg cct gac etc get gca tac aac gee ate teg 33 6 
He Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala He Ser 
80 85 90 

aaa cct gaa gtc ctt aca ccg caa ctg gee agg gtc gta tec gat ggt 384 
Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

gaa gtg ctg tac atg ccg agt ate cgc cag egg ttc tec tgc gat gta 432 
Glu Val Leu Tyr Met Pro Ser He Arg Gin Arg Phe Ser Cys Asp Val 
HO 115 120 125 

teg ggt gtc gat acg gag tec ggt get aca tgt egg ate aaa att ggt 480 
Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys He Gly 
130 135 140 

tec tgg acc cac cac agt aga gag att tct gta gat ccc acg aca gaa 528 
Ser Trp Thr His His Ser Arg Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

aat agt gat gat tct gaa tac ttc tec caa tac tct cgc ttt gaa ate 576 
Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 
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ttg gac gtc aca cag aag aag aac teg gtt acc tac tct tgc tgt ccg 624 
Leu Asp Val Thr Gin Lys Lys Asn Ser Val Thr Tyr Ser Cys Cys Pro 
175 180 185 



gag gca tac gag gac gtt gaa gtg agt etc aat ttc egg aag aag gga 672 
Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 



cgc tec gaa att ctt tag 
Arg Ser Glu He Leu 
210 



690 



<210> 2 
<211> 229 
<212> PRT 

<213> Lymnaea stagnalis 



<400> 2 

Met Arg Arg Asn He Phe Cys Leu Ala Cys Leu Trp He Val Gin Ala 

-15 -10 -5 

Cys Leu Ser Leu Asp Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 

-11 5 10 

Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 

15 20 25 

Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 

30 35 40 45 

Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 

50 55 60 

Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 

65 70 75 

He Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala lie Ser 

80 85 . 90 



(4/20) 



Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

Glu Val Leu Tyr Met Pro Ser lie Arg Gin Arg Phe Ser Cys Asp Val 
110 115 120 125 

Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg lie Lys lie Gly 
130 135 140 

Ser Trp Thr His His Ser Arg Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

Leu Asp Val Thr Gin Lys Lys Asn Ser Val Thr Tyr Ser Cys Cys Pro 
175 180 185 

Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 

Arg Ser Glu He Leu 
210 



<210> 3 
<211> 690 
<212> DNA 

<213> Lymnaea stagnalis 

<220> 

<221> CDS 

<222> (1) . . (687) 

<220> 

<221> mat_jPeptide 
<222> (58) . . (687) 



<400> 3 
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atg cgt cga aac att ttc tgc ctt get tgt etc tgg ate gtg caa ggg 48 
Met Arg Arg Asn lie Phe Cys Leu Ala Cys Leu Trp He Val Gin Gly 
-15 -10 -5 

tgt eta age ttg gac egg gca gac ate ttg tac aac ata cgt cag aca 96 
Cys Leu Ser Leu Asp Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
-11 5 10 

teg aga ccg gat gtg att ccc aca cag cga gat cgc cca gtg gcg gtg 144 
Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

tec gtc tct ttg aag ttc ate aac ate ttg gaa gtg aat gaa ata acc 192 
Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 
30 35 40 45 

aat gaa gtg gac gtg gtc ttt tgg cag cag acg aca tgg teg gac agg 240 
Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 
50 55 60 

acc etc gee tgg aac agt tct cac tea cca gat cag gtt tec gtg cca 288 
Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

ata age tct ttg tgg gtg cct gac etc get gca tac aac gec ate teg 336 
lie Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala He Ser 
80 85 90 

aaa cct gaa gtc ctt aca ccg caa ctg gec agg gtc gta tec gat ggt 3 84 
Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

gaa gtg ctg tac atg ccg agt ate cgc cag egg ttc tec tgc gat gta 432 
Glu Val Leu Tyr Met Pro Ser lie Arg Gin Arg Phe Ser Cys Asp Val 
HO 115 120 125 



teg ggt gtc gat acg gag tec ggt get acg tgt egg ate aaa att ggt 
Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg He Lys He Gly 
130 135 140 



480 
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tec tgg acc cac cac agt gga gag att tct gta gat ccc acg aca gaa 528 
Ser Trp Thr His His Ser Gly Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

aat agt gat gat tct gaa tac ttc tec caa tac tct cgc ttt gaa ate 576 
Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

ttg gac gtc aca cag aag aag aac teg gtt ate tac tct tgc tgt ccg 624 
Leu Asp Val Thr Gin Lys Lys Asn Ser Val He Tyr Ser Cys Cys Pro 
175 180 185 

gag gca tac gag gac gtt gaa gtg agt etc aat ttc egg aag aag gga 672 
Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 

cgc tec gaa att ctt tag 590 
Arg Ser Glu He Leu 
210 



<210> 4 
<211> 229 
<212> PRT 

<213> Lymnaea stagnalis 
<400> 4 

Met Arg Arg Asn He Phe Cys Leu Ala Cys Leu Trp He Val Gin Gly 
-15 -10 -5 

Cys Leu Ser Leu Asp Arg Ala Asp He Leu Tyr Asn He Arg Gin Thr 
-1 1 5 io 

Ser Arg Pro Asp Val He Pro Thr Gin Arg Asp Arg Pro Val Ala Val 
15 20 25 

Ser Val Ser Leu Lys Phe He Asn He Leu Glu Val Asn Glu He Thr 
3 0 35 40 45 

Asn Glu Val Asp Val Val Phe Trp Gin Gin Thr Thr Trp Ser Asp Arg 



(7/20) 



50 



55 



60 



Thr Leu Ala Trp Asn Ser Ser His Ser Pro Asp Gin Val Ser Val Pro 
65 70 75 

lie Ser Ser Leu Trp Val Pro Asp Leu Ala Ala Tyr Asn Ala lie Ser 
80 85 90 

Lys Pro Glu Val Leu Thr Pro Gin Leu Ala Arg Val Val Ser Asp Gly 
95 100 105 

Glu Val Leu Tyr Met Pro Ser lie Arg Gin Arg Phe Ser Cys Asp Val 
110 115 120 125 

Ser Gly Val Asp Thr Glu Ser Gly Ala Thr Cys Arg lie Lys lie Gly 
130 135 140 

Ser Trp Thr His His Ser Gly Glu He Ser Val Asp Pro Thr Thr Glu 
145 150 155 

Asn Ser Asp Asp Ser Glu Tyr Phe Ser Gin Tyr Ser Arg Phe Glu He 
160 165 170 

Leu Asp Val Thr Gin Lys Lys Asn Ser Val He Tyr Ser Cys Cys Pro 
175 180 185 

Glu Ala Tyr Glu Asp Val Glu Val Ser Leu Asn Phe Arg Lys Lys Gly 
190 195 200 205 



Arg Ser Glu He Leu 
210 



<210> 5 
<211> 675 
<212> DNA 

<213> Bulinus truncatus 



<220> 
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<221> CDS 

<222> (1) . . (672) 

<220> 

<221> mat_peptide 
<222> (64) . . (672) 

<400> 5 

atg get gaa eta cga agg ate att ctt ctg eta tgt act att gec ttt 48 

Met Ala Glu Leu Arg Arg lie He Leu Leu Leu Cys Thr He Ala Phe 

-20 -15 -10 

cat gtt tec cat gga caa ata aga tgg acg ctg ctg aat cag ate acc 96 
His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 

ggt gaa tct gac gtc att ccg ctg tct aac aac acg ccc ctg aat gtg 144 
Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 20 25 

teg ctg aat ttt aag ctg atg aat ate gta gag gcg gac aca gaa aaa 192 
Ser Leu Asn Phe Lys Leu Met Asn He Val Glu Ala Asp Thr Glu Lys 
30 35 40 

gat caa gtg gag gtc gtg ctg tgg aca cag get age tgg aaa gtg ccg 240 
Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 

tat tac age tea ctg ctg tec tct age agt tta gac cag gtg age tta 2 88 
Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 

cca gtc age aaa atg tgg acc cca gac ctt tct ttc tac aac gee ate 33 6 
Pro Val Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 



get gca ccc gag ttg 
Ala Ala Pro Glu Leu 
95 



etc tec gca gac cgc gtg gtg gtc tct aag gac 
Leu Ser Ala Asp Arg Val Val Val Ser Lys Asp 
100 105 



384 
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ggg age gtc att tac gtc ccc age cag agg gtc cgt ttc acc tgc gac 432 
Gly Ser Val lie Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
110 115 120 

ctt att aat gtc gac acg gag ccg gga gec acc tgt cgc ate aaa gtc 480 
Leu lie Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

gga tec tgg acc cac gac aac aaa cag ttc gee ctg ate acc ggg gag 528 
Gly Ser Trp Thr His Asp Asn Lys Gin Phe Ala Leu He Thr Gly Glu 
140 145 150 155 

gag ggg gtg gtg aat att gca gag tac ttc gac age cca aag ttt gac 576 
Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Phe Asp 
160 165 170 

ctt ttg agt gee aca cag agt ctg aat cgc aag aag tac age tgt tgc 624 
Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Ser Cys Cys 
175 180 185 

gag aat atg tat gat gac att gaa att acc ttt gca ttc aga aag aag 672 
Glu Asn Met Tyr Asp Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 

taa 6 7 5 



<210> 6 
<211> 224 
<212> PRT 

<213> Bulinus truncatus 
<400> 6 

Met Ala Glu Leu Arg Arg He He Leu Leu Leu Cys Thr He Ala Phe 
-20 -15 -10 

His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 



Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 



(10/20) 



15 



20 



25 



Ser Leu Asn Phe Lys Leu Met Asn He Val Glu Ala Asp Thr Glu Lys 
30 35 40 

Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 

Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 

Pro Val Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 

Ala Ala Pro Glu Leu Leu Ser Ala Asp Arg Val Val Val Ser Lys Asp 
95 100 105 

Gly Ser Val He Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
110 115 120 

Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

Gly Ser Trp Thr His Asp Asn Lys Gin Phe Ala Leu lie Thr Gly Glu 
140 145 150 155 

Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Phe Asp 
160 165 170 

Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Ser Cys Cys 
175 180 185 



Glu Asn Met Tyr Asp Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 



<210> 7 
<2U> 675 
<212> DNA 
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<213> Bulinus truncatus 



<220> 

<221> CDS 

<222> (1) . . (672) 



<220> 

<221> mat_peptide 
<222> (64) . . (672) 



<400> 7 

atg get gaa eta cga ggg ate att ctt ctg eta tgt act att gee ttt 48 
Met Ala Glu Leu Arg Gly lie He Leu Leu Leu Cys Thr He Ala Phe 
-20 -15 -10 



cat gtt tec cat gga caa ata aga tgg acg ctg ctg aat cag ate acc 96 
His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 
-5 -11 5 10 

ggt gaa tct gac gtc att ccg ctg tct aac aac acg cca ctg aat gtg 144 
Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 20 25 



teg ctg aat ttt aag ctg atg aat ate tta gag gcg gac aca gag aaa 192 
Ser Leu Asn Phe Lys Leu Met Asn He Leu Glu Ala Asp Thr Glu Lys 
30 35 40 

gat caa gtg gag gtc gtg. ctg tgg aca cag get age tgg aaa gtg ccg 240 
Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 

tat tac age tea ctg ctg tec tct age agt tta gac cag gtg age tta 2 88 
Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 

cca gee age aaa atg tgg acc cca gac ctt tct ttc tat aac gee ate 33 6 
Pro Ala Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 

get gca ccc gag ttg etc tec aca gac cgc gtg gtg gtc tct aag gac 384 



(12/20) 



Ala Ala Pro Glu Leu Leu Ser Thr Asp Arg Val Val Val Ser Lys Asp 

95 100 105 

ggg age gtc att tac gtg ccc age cag agg gtc cgt ttc acc tgc gac 432 

Gly Ser Val lie Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 

110 115 120 

ctt att aat gtg gac acg gag ccg gga gec acc tgt cgc ate aaa gtc 480 

Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 

125 130 135 



gga tec tgg acc ttc gac aac aaa cag etc gee ctg ate acc ggg gag 528 

Gly Ser Trp Thr Phe Asp Asn Lys Gin Leu Ala Leu He Thr Gly Glu 

140 145 150 155 

gag ggg gtg gtg aat att gca gag tac ttc gac age cca aag tac gac 57 6 

Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Tyr Asp 
160 165 170 

ctt ttg agt gee aca cag agt ctg aat cgc aag aag tac aga tgt tgc 624 

Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Arg Cys Cys 
175 180 185 

gag aat atg tat gaa gac att gaa att acc ttt gca ttc aga aag aag 672 

Glu Asn Met Tyr Glu Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 

taa 675 



<210> 8 
<211> 224 
<212> PRT 

<213> Bulinus truncatus 
<400> 8 

Met Ala Glu Leu Arg Gly He He Leu Leu Leu Cys Thr He Ala Phe 
-20 -15 -10 

His Val Ser His Gly Gin He Arg Trp Thr Leu Leu Asn Gin He Thr 



(13/20) 

"5 -11 5 10 

Gly Glu Ser Asp Val He Pro Leu Ser Asn Asn Thr Pro Leu Asn Val 
15 . 20 25 

Ser Leu Asn Phe Lys Leu Met Asn He Leu Glu Ala Asp Thr Glu Lys 
30 35 40 

Asp Gin Val Glu Val Val Leu Trp Thr Gin Ala Ser Trp Lys Val Pro 
45 50 55 



Tyr Tyr Ser Ser Leu Leu Ser Ser Ser Ser Leu Asp Gin Val Ser Leu 
60 65 70 75 

Pro Ala Ser Lys Met Trp Thr Pro Asp Leu Ser Phe Tyr Asn Ala He 
80 85 90 

Ala Ala Pro Glu Leu Leu Ser Thr Asp Arg Val Val Val Ser Lys Asp 
95 100 105 

Gly Ser Val He Tyr Val Pro Ser Gin Arg Val Arg Phe Thr Cys Asp 
HO 115 120 

Leu He Asn Val Asp Thr Glu Pro Gly Ala Thr Cys Arg He Lys Val 
125 130 135 

Gly Ser Trp Thr Phe Asp Asn Lys Gin Leu Ala Leu He Thr Gly Glu 
140 145 150 155 

Glu Gly Val Val Asn He Ala Glu Tyr Phe Asp Ser Pro Lys Tyr Asp 
160 165 170 

Leu Leu Ser Ala Thr Gin Ser Leu Asn Arg Lys Lys Tyr Arg Cys Cys 
175 180 185 



Glu Asn Met Tyr Glu Asp He Glu He Thr Phe Ala Phe Arg Lys Lys 
190 195 200 



(14/20) 



<210> 9 
<211> 502 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> DOMAIN 
<222> (1) . . (235) 

<400> 9 

Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser Leu 
15 10 15 

Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr Lys Glu 
20 25 30 

Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala Asn Asp Ser 
35 40 45 

Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin lie Met Asp 
50 55 60 

Val Asp Glu Lys Asn Gin Val Leu Thr Thr Asn lie Trp Leu Gin Met 
65 70 75 80 

Ser Trp Thr Asp His Tyr Leu Gin Trp Asn Val Ser Glu Tyr Pro Gly 
85. 90 95 

Val Lys Thr Val Arg Phe Pro Asp Gly Gin He Trp Lys Pro Asp He 
100 105 HO 

Leu Leu Tyr Asn Ser Ala Asp Glu Arg Phe Asp Ala Thr Phe His Thr 
115 120 125 

Asn Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Leu Pro Pro Gly 
130 135 140 



He Phe Lys Ser Ser Cys Tyr He Asp Val Arg Trp Phe Pro Phe Asp 
145 150 155 160 



(15/20) 



Val Gin His Cys Lys Leu Lys Phe Gly Ser Tip Ser Tyr Gly Gly Trp 
165 170 175 

Ser Leu Asp Leu Gin Met Gin Glu Ala Asp He Ser Gly Tyr He Pro 
180 185 190 

Asn Gly Glu Trp Asp Leu Val Gly He Pro Gly Lys Arg Ser Glu Arg 
195 200 205 

Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr Phe Thr Val 
210 215 220 

Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Asn Leu Leu He Pro 
225 230 235 240 

Cys Val Leu He Ser Ala Leu Ala Leu Leu Val Phe Leu Leu Pro Ala 
245 250 255 

Asp Ser Gly Glu Lys He Ser Leu Gly He Thr Val Leu Leu Ser Leu 
260 265 270 

Thr Val Phe Met Leu Leu Val Ala Glu lie Met Pro Ala Thr Ser Asp 
275 280 285 

Ser Val Pro Leu He Ala Gin Tyr Phe Ala Ser Thr Met lie He Val 
290 295 300 

Gly Leu Ser Val Val Val Thr Val He Val Leu Gin Tyr His His His 
305 310 315 320 

Asp Pro Asp Gly Gly Lys Met Pro Lys Trp Thr Arg Val He Leu Leu 
325 330 335 

Asn Trp Cys Ala Trp Phe Leu Arg Met Lys Arg Pro Gly Glu Asp Lys 
340 345 350 



Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser Leu Ala Ser 
355 360 365 



Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asn Gly Asn Leu 



(16/20) 



370 375 380 

Leu Tyr He Gly Phe Arg Gly Leu Asp Gly Val His Cys Val Pro Thr 
385 390 395 400 

Pro Asp Ser Gly Val Val Cys Gly Arg Met Ala Cys Ser Pro Thr His 
405 410 415 

Asp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly Asp Pro Asp 
420 425 430 

Leu Ala Lys He Leu Glu Glu Val Arg Tyr He Ala Asn Arg Phe Arg 
435 440 445 

Cys Gin Asp Glu Ser Glu Ala Val Cys Ser Glu Trp Lys Phe Ala Ala 
450 455 460 

Cys Val Val Asp Arg Leu Cys Leu Met Ala Phe Ser Val Phe Thr He 
465 470 475 480 

He Cys Thr He Gly lie Leu Met Ser Ala Pro Asn Phe Val Glu Ala 
485 490 495 

Val Ser Lys Asp Phe Ala 
500 



<210> 10 
<211> 10 
<212> PRT 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: N-terminus of 
mature LAChBPl 

<400> 10 

Leu Asp Arg Ala Asp He Leu Tyr Asn He 
15 10 



(17/20) 



<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Oligonucleotides encoding N-terminal peptide of 
LAChBPl 

<220> 

<221> modif ied_base 
<222> (13) 
<223> i 

<400> 11 

cggatccgay mgagcngaya thytntayaa ya 

<210> 12 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primerl useful 
for cloning cDNA encoding LAChBP (optionally with 
Primer2) 

<220> 

<221> modif ied_base 
<222> (14) 
<223> i 

<220> 

<221> modif ied_Jbase 
<222> (20) 
<223> i 



(18/20) 



<400> 12 

gcgaattcga yacagarwsa ggngcnacnt g 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer2 
useful for cloning cDNA encoding LAChBP 
(optionally with Primerl) 

<220> 

<221> modif ied_base 
<222> (20) 
<223> i 

<400> 13 

gcgaagcttc rtcytcrtaa gcytcngcrc arc 

<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: His-tag 
<400> 14 

Ser Arg Gly His His His His His His 
1 5 

<210> 15 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: His-tag 
<400> 15 

Glu Phe Lys Asp Asp Asp Asp Lys His His His His His His 
15 10 

<210> 16 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Additonal 

amino acids at the N~terminus of mature LAChBP due 
to alpha-mating factor cleavage site 

<400> 16 
Glu Ala Glu Ala 
1 

<210> 17 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP / alpha7 nAChR chimera 

<400> 17 

gcgctcgaga aaagagaggc tgaagctttg gaccgggcag acatctt 

<210> 18 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 18 

cgcgaattca agaatttcgg agcgtccctt 

<210> 19 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 19 

gtggaaacca gacattctcc tctacaacgc catctcgaaa cc 

<210> 20 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer useful 
for generating LAChBP/alpha7 nACHR chimera 

<400> 20 

gaggagaatg tctggtttcc acaaagagct tattggcac 
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